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Introduction 


Welcome to Edexcel GCSE (9-1) Computer 
Science 


Computational thinking in context 

This course is designed to help you develop your knowledge and 
understanding of computer science through studying the importance of 
computation in the world and how it will evolve in the future. Through 
practical involvement in a range of activities you will apply this knowledge 
and develop the skills to help you succeed in your chosen pathway. 


There are many benefits to taking the Edexcel GCSE Computer Science course: 


¢ It has areal applied focus. You will be encouraged to put the theory you 
are learning into context and apply what you have learned to your own 
practical activities. This makes it much more fun. 


It reflects today’s world - the issues and topics you will learn about are 
up to date and will help you to understand how technology can be used to 
tackle current issues that impact on modern society. 


* You will gain a well-rounded understanding of computer science. 
Through an engaging introduction to the core principles, you will develop 
skills in problem solving and computational thinking. You will learn how 
to decompose and model aspects of real-world situations, and as a result 
be able to design, build and test a fully programmed solution to 
a problem. 


¢ You will also have the opportunity to improve your transferable skills, 
developing ‘underpinning’ concepts that are useful in many subjects and 
careers, such as mathematics, science and engineering. 

* If you do well in this course you will be in a good position to progress 
to the next level of study — whether this is an A level or a vocational 
qualification, such as a BTEC National. The content of this GCSE is ideal 
grounding for other qualifications; it has been designed using a similar 
approach to make the experience of moving on a smooth one. 


How you will be assessed 
The GCSE course ‘consists of three separate components: 


Component 1: Principles of Computer Science 
Component 2: Application of Computational Thinking 
Component 3: Project 


Components 1 and 2 are assessed by written examinations, each accounting 
for 40 per cent of the total mark. Component 3 is worth 20 per cent of the 
total mark and is internally assessed by your school. For more information 


about how you will be assessed, see the ‘Preparing for your exam’ section on 
page 226, 


Introduction 


How to use this book 


This book is organised in the same way as the Edexcel GCSE specification, with six main 
chapters plus an additional section that supports you in preparing for examinations. 


Each chapter gives you all the information you need to know and guides you through the 
content of the course in a practical and engaging way, making it clear what you will cover 
and giving you useful activities and questions to help you practise what you have learned. 


In this student book there are lots of different features. They are there to help you 
learn about the content in your course in different ways, understand it from multiple 
perspectives and get the most from your learning. 


* Learning outcomes — these Learning outcomes >» 
are listed at the beginning 


of each chapter so you know By the end of this section you should be able to: 
exactly what you are going * describe what an algorithm is. 
to learn and understand the * explain what algorithms are used for. 


ister success criteria. * express algorithms as flowcharts, pseudo-code and written descriptions. 


* use and describe the purpose of arithmetic operators. 


* Key terms - there are certain terms that you will need to know and be 
able to explain. These boxes explain what the words mean. The words 
themselves are also highlighted in the main text, and you will find all the Sequence: an ordered set of 
key terms in the Glossary at the end of the book. instructions. 


Algorithm: a precise method 
for solving a problem. It consists 
of a sequence of step-by-step 
instructions. 


* Activities — these are designed 
to build your knowledge and 
understanding and develop your 
computational and problem- 
solving skills. 


Produce a written description 
of an algorithm for getting 

to school. It should start with 
leaving home and end with 
arriving at school. For example, 
the algorithm could start with 
‘Walk to bus stop’. 


has been included to help you 


d a alt t SET correct TO ‘LetMeIn’ #The variable ‘correct’ is assigned 
a. er ‘ll of : ee the value of the password stored in 
practical skills before you try an heleyetenn 


activity yourself. 
oe FOR index FROM 1 TO 3 DO 


SEND ‘Please enter your password.’ TO DISPLAY 
RECEIVE password FROM KEYBOARD 
IF password = correct THEN 
SEND ‘You entered the correct password.’ TO DISPLAY 
END IF 
END FOR 


Introduction 


« Extend your knowledge - 
il det oai Bim these features provide ideas 


When you enter a search term into Google’, a list of links to websites is for extending the activity if you 
returned. But why are they presented in that particular order? Research would like to challenge yourself 
the PageRank algorithm that Google’ uses to rate the importance of further. 


websites and write a short report about your findings that explains how 
the order is determined. 


¢ Checkpoint - this feature 
contains questions to check your 
= — : knowledge and questions to 
$1 Explain what is meant by ‘decomposition’ and the benefits it provides challenge your understanding. 
for programmers. 


S2 Explain what is meant by ‘abstraction 
* = 


C1 Describe examples of the use of ‘decomposition’ and ‘abstraction’ 
when solving a problem. 


C2 In your own words, explain what is meant by ‘computational thinking’. 


fact that will help to give you 
The computer program that created the algorithm to travel from 


London to Glasgow was following an algorithm of its own - an algorithm 
instructing it how to create another algorithm. 


ideas to research more around 
the subject. 


a oe ei ccstion ese 
questions match the style of 


The XtraSave chain of supermarkets collects data about product sales. questions that you are likely to 
The record for each product includes: see in the written exams and will 
* its unique ID (e.g. X14 or X98); give you useful practice as you go 
* the total number sold in each month of the year. through your course. 

1 Identify a suitable data structure to store this data. (1 mark) 

2 Justify your choice of data structure. (3 marks) 

3 Drawa diagram of the data structure that shows the amount sold in the 


first three months of the year. Include data for at least two products. 
(3 marks) 


Introduction 


* Exam tip — hints and tips to aid 
your learning and help you in 
the exam. 


* Summary ~ a handy revision 
checklist of key points you 
will need to remember about 
the chapter. 


This question is testing your ability to select and design data 
structures. Make sure you have: 


* identified a suitable data structure to store the data and justified your 
choice; 


* provided the sample data requested (i.e. sales figures for two products | 
for the first three months of the year). | 
| 


Row and column headings can be included as long as they are clearly 
distinguishable from the data. 


¢ A data structure is an organised collection of related elements. Arrays 
and records are common data structures. 


* A one-dimensional array is a list of elements, each of which has a 
unique index value representing its position in the list. 


¢ A two-dimensional array is a matrix of rows and columns. Each element 
in the array has a unique pair of indices, one to identify the row and 
one the column in which it is located. 


¢ All the elements in an array have the same data type. 


¢« Arecord consists of a collection of fields. The values stored in a record 
can be of different data types. 


Preparing for your exams 


At the end of Chapter 6 there is an exam preparation section with tips and 
guidance for achieving success in your written exams. You will find example 
questions and answers, together with detailed notes and explanations 
about the quality of the answers shown. This will help you build your 
understanding of how to write better answers and achieve more marks. 
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The route on this interactive map has 


been calculated using an algorithm 


Unambiguous: this means 
that the instructions cannot be 
misunderstood. Simply saying 
‘turn’ would be ambiguous 
because you could turn left 

or right. 


All instructions given toa 
computer must be unambiguous 
or it won't know what to do. 


Sequence: an ordered set of 
instructions. 

Algorithm: a precise method for 
solving a problem. 


1.1 Algorithms 


Understanding algorithms 


Learning outcomes h 


By the end of this section you should be able to: 


¢ describe what an algorithm is. 
¢ explain what algorithms are used for. 
¢ express algorithms as flowcharts, pseudo-code and written descriptions. 


* use and describe the purpose of arithmetic operators. 


An example of an algorithm 

An interactive map is a useful way to find a route between two locations. 
This image shows a route between two cities that was calculated by a 
mapping program. 


¢ It is unambiguous in telling the driver exactly what to do, like ‘turn left’, 
‘turn right’ or ‘continue straight’. 


e |t isa sequence of steps. 
e {t can be used again and will always provide the same result. 


¢ It provides a solution to a problem, in this case how to get from London 
to Glasgow. 

A solution to a problem with these characteristics is called an algorithm. 

Most problems have more than one solution, so different algorithms can be 

created for the same problem. 


The computer program that created the algorithm to travel from 
London to Glasgow was following an algorithm of its own — an algorithm 
instructing it how to create another algorithm. 


Successful algorithms 
There are three criteria for deciding whether an algorithm is successful. 


* Accuracy - it must lead to the expected outcome (e.g. create a route from 
London to Glasgow). 


* Consistency — it must produce the same result each time it is run. 


° Efficiency — it must solve the problem in the shortest possible time using 
as few computer resources as possible. In this example the mapping 
software is replacing a manual method, and if it were no faster than 
looking in an atlas then it would not be an improvement on the older 
method. Later in the chapter there is a section on algorithms used to sort 
and search data. Some of these algorithms are more efficient than others 
and will sort the data far more quickly. 


The relationship between algorithms and programs 


Algorithms and programs are closely related, but they are not the 
same. An algorithm is a detailed design for a solution; a program is the 
implementation of that design. 


This chapter is all about algorithms. We look at how algorithms are 
implemented in high-level programming languages in Chapter 2. It’s up 
to you whether you study these two chapters sequentially or in parallel. 
So you could either study Chapter 1 followed by Chapter 2, or learn about 
algorithms and how to create them in this chapter and at the same time 
consult Chapter 2 to find out how to translate algorithms into programs. 


Displaying an algorithm 

We carry out many everyday tasks using algorithms because we are 
following a set of instructions to achieve an expected result, for example 
making a cup of coffee. If we have performed the task many times before, 
we usually carry out the instructions without thinking, but if we are doing 
something unfamiliar, such as putting together a flat-pack chest of drawers, 
then we follow the instructions very carefully. 


An algorithm can be expressed in different ways. 


Written descriptions 

A written description is the simplest way of expressing an algorithm. 
Here is an algorithm describing the everyday task of making a cup of 
instant coffee. 


Fill kettle with water. 
Turn on kettle. 

Place coffee in cup. 
Wait for water to boil. 
Pour water into cup. 
Add milk and sugar. 
Stir. 


Flowcharts 
Flowcharts can be used to represent an algorithm graphically. They provide 
a more visual display. 


There are formal symbols that have to be used in a flowchart — you can't 
just make up your own because nobody else would be able to follow your 
algorithm. 


Figure 1.1 shows the flowchart symbols that should be used. 


High-level programming 
language: a programming 
language that resembles natural 
human language. 


Flowchart: a graphical 
representation of an algorithm. 
Each step in the algorithm 
is represented by a symbol. 

- Symbols are linked together 

_ with arrows showing the order in 
which steps are executed. 


Produce a written description 


of an algorithm for getting | 
to school. It should start with 
leaving home and end with 
arriving at school. For example, | 
the algorithm could start with 
‘Walk to bus stop’. 


Indicates the start 
, orend of an 
ee eS algonthm: 


| Indicates a 
process to be 
carried out. 


Indicates a 
. decision to be 
made. 


Indicates an input 
! or output. 


Shows the logical 
flow of the 
algorithm. 


Figure 1.1 Flowchart symbols 


Start 


Fill kettle with water 


; This is a process (an action 
= that has to be performed). 


Turn on kettle 


: 


Place coffee in cup 


’ This instruction is unambiguous — 
the water must be boiling. 


Wait for kettle to boil 


The flowchart in Figure 1.2 is an alternative way of depicting 
the algorithm expressed above as a written description. 


Display the ‘journey to school’ algorithm, which you created 
in the previous activity, as a flowchart. 


A student has created a written algorithm for preparing a | 
bath. Display the following as a flowchart — you may need to | 
change the order or add actions. 


e Put in the plug. 
¢ Fill bath to the correct level. 


Stating ‘wait for the water to heat’ 


er | 


Pour water into cup 


} 


Add milk and sugar 


> eo 


Stir 


; 


End 


Figure 1.2 Flowchart of an algorithm to 
make a cup of coffee 


Pseudo-code: a structured, code- 
like language that can be used to 
describe an algorithm. 


There are many different versions 
of pseudo-code and often they are 
unique to a particular organisation 
or examination board. Investigate 
, the Edexcel pseudo-code that you 
will need for your GCSE course and 
which will be used in this book. 


would be ambiguous. How hot? 


e Check temperature is OK. 


The algorithms you have tooked at so far are designed for humans to follow. 
Algorithms also form the basis of computer programs. A computer is a 
mindless machine that simply does exactly what it is told and follows a set of 
instructions, but computers can carry out these instructions far more quickly 
than humans. That is why they are so useful. 


Pseudo-code 

In addition to flowcharts and written descriptions, algorithms can also be 
expressed in pseudo-code. The pseudo-code can then be used to code the 
solution in an actual programming language. 


It allows the developer to concentrate on the logic and efficiency of the 
algorithm without having to bother about the rules of any particular 
programming language. It is relatively straightforward to translate an 
algorithm written in pseudo-code into any high-level programming language. 


Example of a simple algorithm 


To introduce the Edexcel pseudo-code, here is a simple algorithm that asks the 
user to input two numbers and then outputs the result of adding them together. 


Written description 


Enter first number. 
Enter second number. 


Calculate total by adding first and second numbers. 
Output total. 


Start 


' 


Enter first 
number 


; 


A — - 


Enter second 
number 


' 


total = first number 
+ second number 


So 


Output 
total 


End 


Figure 1.3 Flowchart showing the 
adding of two numbers 


Variable: a ‘container’ used to 
store data. The data stored ina 
variable is referred to as a value. 
The value stored in a variable is 
not fixed. The same variable can 
store different values during the 
course of a program and each 
time a program is run. 


Identifier: a unique name given 
to a variable or a constant. Using 
descriptive names for variables 

makes code much easier to read. 


Arithmetic operator: an 
operator that performs a 
calculation on two numbers. 


Pseudo-code 


SEND ‘Please enter the first number.’ TO DISPLAY 
RECEIVE firstNumber FROM KEYBOARD 

SEND ‘Please enter the second number.’ TO DISPLAY 
RECEIVE secondNumber FROM KEYBOARD 

SET total TO firstNumber + secondNumber 

SEND total TO DISPLAY 


The pseudo-code spells out the step-by-step instructions that the 
computer will be expected to carry out. It also introduces some important 
programming concepts. 


¢ The numbers entered by the user are stored in two variables with the 
identifiers ‘firstNumber’ and ‘secondNumber’. 
¢ The result of adding the numbers together is stored in the variable ‘total’. 


* When some text is to be displayed, for example ‘Please enter the first 
number.’ it has to be enclosed in quotation marks, either single or double. 

¢ When a variable is to be displayed, the quotation marks are not used. If 
they were, then, in the last instruction, the word ‘total’ would be displayed 
and not the number it represents. 

* Arithmetic operators are used to perform calculations. This box shows 
the arithmetic operators. 


+ Addition: add the values 8+5=13 

together. myScorel + myScore2 
- Subtraction: subtract the 17 -4=13 

second value from the first. myScorel - myScore2 
= Multiplication: multiply the 6*9=54 

values together. numberBought * price 
/ Real division: divide the first 13/4=3.25 


value by the second value and —_totaiMarks/numberTests 
return the result including 


decimal places. 


DIV Quotient: like division, but it is Div4=3 
only returns the whole number — totalMarks DIV numberTests 
or integer. 
MOD Modulus/modulo: this will return 13/4 =3 remainder 1 
the remainder of a division. Therefore 13 MOD 4=1 
A Exponentiation: this is for‘to 343=27 
the power of’. It is the same as writing 3° 


Constant: a ‘container’ that holds 
a value that never changes. Like 
variables, constants have unique 
identifiers. 


Variables and constants 

Variables play an important role in algorithms and programming. 
The value stored by a variable can change as a program is running. 
Variables are extremely useful in programming because they make it 
possible for the same program to process different sets of data. 


A constant is the opposite of a variable. It is a ‘container’ that holds 
a value that always stays the same. Constants are useful for storing 
fixed information, such as the value of pi, the number of litres ina 
gallon or the number of months in a year. 


Each variable and constant in an algorithm has to have a unique 
identifier. It is important to choose descriptive names for identifiers. 
This will make your code much easier to read. For example, a 
variable to hold a user’s first name could be given the identifier 
‘firstName’ so that it is indicative of the data it contains. If it were 
given the identifier ‘X’ then it would be ambiguous what data 

it contained. 


Naming conventions for variables and constants 


It is good practice to adopt a consistent way of writing identifiers 
throughout an algorithm. 

A common convention is to use camel case for compound words 

(e.g. firstName, secondName) with no space between words and 

the second word starting with a capital letter. Alternatives are to 
capitalise the first letter of both words, e.g. FirstName, SecondName, 
or to separate the words with an underscore, e.g. first_name, second_ 
name, known as snake case. 


Here is a written description of an algorithm. 


Enter the first number. 
Enter the second number. 


The third number is equal to the first number multiplied by the 
second number. | 


Display the third number. 


Express this algorithm in pseudo-code. 


"This algorithm is displayed as a 
flowchart. 


Start 


Input 
username 


a 


: 


Does 
username 
exist? 


a | 


Input 
password 


= _ all 


Is 
password 
correct? 


NO 
— oe 


YES @ 


End 


Figure 1.4 Flowchart of an algorithm 


Produce a written description of 
this algorithm. 


Extend your knowledge * 


When you enter a search term into Google’, a list of links to websites is 
returned. But why are they presented in that particular order? Research 
the PageRank algorithm that Google’ uses to rate the importance of 
websites and write a short report about your findings that explains how 
the order is determined. 


Summary 


¢ An algorithm is a precise method for solving a problem. 


¢ Algorithms can be displayed as written descriptions, flowcharts and in 
pseudo-code. 


¢ Pseudo-code is a structured, code-like language. 
¢ Pseudo-code is translated into program code. 
e Arithmetic operators are used in calculations. 


¢ Variables and constants are ‘containers’ for storing data. The value 
stored in a variable can change, whereas the value of a constant never 
changes. 


¢ Selecting descriptive names for identifiers makes code easier to read. 


Strengthen : > » 


—— 


$1 Produce a written description of an algorithm for borrowing a book 
from the library. 


S2 Describe what each of the seven arithmetic operators does. 

$3 Describe what a variable is and explain why variables are useful. 

S4 Explain the difference between a variable and a constant. 

Challenge 

C1 Produce a flowchart describing an algorithm for making a cheese 
sandwich. 


C2 Write an algorithm expressed in pseudo-code that receives three 
numbers from the keyboard, calculates and displays the average. 


How confident do you feel about your answers to these questions? If 
you're not sure you answered them well, try the following. 


e For S1 reread page 3. 
¢ For S2 study the table on page 5. 
¢ For $3 and S4 look again at page 6. 


Se 


onstruct: a component from 
which something is built. Letters 
and numbers (i.e. a to z and 0 to 
9) are the constructs we use to 
build our language and convey 
meaning. Bricks and cement are 
the basic constructs of a building. 


Selection: a construct that allows 
a choice to be made between 
different alternatives. 


Iteration: a construct that means 
the repetition of a process. An 
action is repeated until there is a 
desired outcome or a condition is 
met. It is often referred to as 

a loop. 


Start 
Fill kettle 
with water 


aii Line 


Turn on kettle 


Is 
temperature of NO 


—- 
water = 
100°C? 
; YES 
Turn off kettle 
; The computer 
would also have 
Pour water into cup to be told how 
much water to 
: pour into the cup! 


Figure 1.5 Part of an algorithm suitable for 
a computer for making coffee 


Creating algorithms 


Learning outcomes bh. 


By the end of this section you should be able to: 
¢ create an algorithm to solve a particular problem. 
* use command sequence, selection and iteration in algorithms. 


Algorithms for computers 

There was an ambiguous statement in the algorithm for making a cup 
of coffee on page 3. After filling the kettle with water and adding coffee 
to the cup, the next instruction was ‘Wait for water to boil’. 


A human can interpret this instruction as meaning that they have to 
keep checking the kettle over and over again until the water is boiling. 
But a computer is unable to interpret an instruction like that. It would 
just wait. And wait. Forever. 


Even worse, the algorithm didn’t explicitly say how to determine if the 
water was boiling. Through experience we humans assume the water is 
boiling when there is lots of steam, sound and bubbles; or, even better, 
when the kettle turns itself off. An algorithm for a computer would 
have to state that it was waiting until the water reached 100°C. 


A version of this part of the algorithm, suitable for a computer, is 
shown in Figure 1.5. 


This example introduces two new constructs from which algorithms 
are created. 


We have already met the construct command sequence — step-by-step 
instructions in the correct order. To add to this we now have selection 
and iteration. 


We use iteration in our daily lives whenever we carry out an 
action over and over again. For example, at mealtimes we keep 
on eating until our plate is empty or we have had enough to eat. 


When we're travelling by car and the traffic lights are red we have 
to keep waiting until they change to green. 


An actor repeats their lines over and over again until they 
are word perfect. 


Representing selection and iteration in a flowchart 
Selection and iteration are represented in a flowchart as shown in Figure 1.6. 


7 These arrows represent the iteration. 

, If the answer is ‘NO’ then the selection 
There is a question Is question is repeated until the answer is ‘YES’ 
with two alternatives. temperature of NO » the desired outcome. 
This represents the = 
selection. ai 

YES 
_ 


Figure 1.6 Selection and iteration in a flowchart 


Representing selection and iteration in pseudo-code a eS 
Selection IF... THEN...ELSE statement: 


Selection in pseudo-code is represented exactly as we would say it using the IF... THEN...ELSE statement 
an IF...THEN...ELSE statement. allows a choice to be made 


between two alternatives based 


_ : met (e.g. IF it is cold THEN wear a 
eee — wT en jumper ELSE wear a T-shirt). 
Switch off kettle 


ESE 
Keep waiting 
END IF 


Relational operator: an operator 
, that compares two values. 


| 
| 


This is an oversimplification as the commands ‘Switch off kettle’ and 
‘Keep waiting’ mean nothing to a computer. 


An IF...THEN statement can be used without an ELSE if there is only one 
course of action to be taken, providing the condition in the IF statement 
is met. 


IF score >= 90 THEN 
SEND ‘Excellent’ TO DISPLAY 
END IF 


For selection, relational operators are used to compare the values. 


Relational operators are used to compare two values. The operators that 
you will be using are: 

= equal to 

> greater than 

>= greater than or equal to 

. less than 

<= less than or equal to 

<> not equal to 


SET passMark TO 75 
RECEIVE mark FROM KEYBOARD 
IF mark >= passMark THEN 

SEND ‘Well done. You’ve passed.’ TO DISPLAY 
ELSE 

SEND ‘Bad luck. You've failed.’ TO DISPLAY 
END IF 


Notice the use of indentation in an IF... THEN...ELSE statement. 

In this example there is only one statement for each alternative, 
but imagine if there were many. The indentation makes it easier 

to see which statement(s) belong to each alternative. Indentation 

is a useful technique for improving the readability of algorithms 
expressed in pseudo-code. You should get into the habit of using it. 
When you move on to implementing your algorithms in a high-level 
programming language you might find that the computer won't 

be able to execute your programs unless you have used 

indentation correctly. 


Activity 7 


IF score <= highScore THEN 


SEND ‘You haven’t beaten your high score.’ TO DISPLAY 
ELSE 


SEND ‘You've exceeded your high score!’ TO DISPLAY 
END IF 


State the output of the algorithm when 
¢ score = 5 and highScore = 10 

* score = 20 and highScore = 10 

* score = 15 and highScore = 15 
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Activity 8 


A driving school uses this rule to estimate how many lessons a learner 
will require. 


_ © Every learner requires at least 20 lessons. 


' ¢ Learners over the age of 18 require more lessons (two additional 
lessons for each year over 18). 


Create an algorithm expressed in pseudo-code that inputs a learner’s age 
_ and calculates the number of driving lessons they will need. 


Nested selection 
The IF... THEN...ELSE statement allows a choice to be made between Keven a 
two possible alternatives. However, sometimes there are more than two 

ee ithe ; Nested IF statement: a nested 
possibilities. This is where a nested IF statement comes in useful. 


_ IF statement consists of one or 
more IF statements placed inside 
A learner handed in three homework assignments, which were each given pi Lads adap pare Ps 
a mark out of 10. All the marks were different. Write an algorithm that Saal Soe oc cuk 
would print out the highest mark. 


Figure 1.7 shows the algorithm expressed as a flowchart: 


Input mark1, 
mark2, mark3 


Is Is 


NO NO 
markl > — mark2 > a 
mark2? mark3? 
; YES j YES 
7 = Output Output 
“ mark2 mark3 
amarante me mark1 > : : 
mark3 mark3? ; ; 
- . 
: ad End | End 
End - ; 
Output 
mark] 
Evel Figure 1.7 Flowchart of an algorithm to 


print out the highest homework mark 
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When you are creating nested IF 
statements you have to ensure 
that each one is completed 

with an END IF statement at the 
correct indentation level. Some 
programming languages do not 
need an END IF statement and 
just use the indentation levels to 
indicate when statements 

are grouped. 


The variables mark1 and mark2 are compared using a relational operator. 
If mark1 is greater than (>) mark2 then it is compared with mark3. If it is 
not greater than mark2, then mark2 must be greater than mark1 and it is 
then compared with mark3. 

This algorithm can also be expressed in pseudo-code: 

RECEIVE markl FROM KEYBOARD 


RECEIVE mark2 FROM KEYBOARD 
RECEIVE mark3 FROM KEYBOARD 
IF markl > mark2 THEN 


IF markl > mark3 THEN #This is an IF statement 
within another IF statement. 
It is called a nested IF. 


SEND markl TO DISPLAY 
ILS a 
SEND mark3 TO DISPLAY 
END IF 
ELSE 
IF mark2 > mark3 THEN #This is another nested IF 
statement. 
SEND mark2 TO DISPLAY 
ELSE 
SEND mark3 TO DISPLAY 
BND TF 
END IF 


In this algorithm there are three IF... THEN...ELSE statements. Two of them 
are completely nested within the outer one. 


The # symbol indicates a comment. This is some text used to explain the 
code and the # symbol shows that it is not to be executed. It can be on 

a line of its own or at the end of the line to which it applies. You should 
get into the habit of adding comments to your algorithms to explain how 
they work and should do the same when writing program code. 


A learner is creating a guessing game. A player has to enter a number no 
greater than 10. If it is too high, they are informed that they have made an 
error, but if it is within the range 1 to 10, they are told whether or not they 
have guessed the correct number. (Assume that the correct number is 3.) 
Create an algorithm to solve this problem and express it as a flowchart 
and in pseudo-code. 


= 
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, Aschool uses this algorithm to calculate the grade learners achieve in 
end-of-topic tests. 


RECEIVE testScore FROM KEYBOARD 
IF testScore >= 80 THEN 
Sanp “A’ “TO DISPRAY 
ELSE 
IF testScore >= 70 THEN 
SEND ‘B’ TO DISPLAY 


ELSE 
IF testScore >= 60 THEN 
SEND ‘C’ TO DISPLAY 
ELSE 
IF testScore > 0 THEN 
| SEND 'D' TO DISPLAY 
| ELSE 
| SEND 'FAIL' TO DISPLAY 
END IF 
END IF 
END IF 
END IF 


What would be the output of this algorithm for these test scores: 91, 56 
and 78? 


Iteration 

When writing programs it is often necessary to repeat the same set of 
statements several times. Rather than simply making multiple copies of the 
statements you can use iteration to repeat them. The algorithm for making 
a cup of coffee includes an instruction to keep waiting until the water in the 
kettle boils. 


IF temperature = 100°C THEN 
Switch off kettle 

ESE 
Keep waiting 

END IF 


Selection would be useless without iteration. The question would be asked 
once and then the program would move on or just stop. There has to be a 
method for repeating the question until there is a desired outcome. Ina 
flowchart this is easy to implement - you just have to draw some arrows. 


In both pseudo-code and program code you have to construct a loop, 
or iteration. There are two types of iteration: indefinite iteration and 


definite iteration. 
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"Indefinite iteration: this is used 
when the number of iterations 
is not known before the loop is 
started. The iterations stop when 
a specified condition is met. 
This sort of loop is said to be 
condition controlled. 


Definite iteration: this is used 
when the number of iterations, 
or turns of the loop, is known 
in advance. It can be set to as 
many turns as you want. This 
sort of loop is said to be count 
controlled. 


It is good practice to indent 

commands that occur within 
a statement block because it 
makes the algorithm clearer. 


Use the pseudo-code command 
SET to assign a value between 1 
and 6 to diceRoll before the start 
of the loop. 


When using a WHILE...DO 

loop, the statement RECEIVE 
guess FROM KEYBOARD, which 
allows the user to enter their 
guess, should be included in the 
algorithm twice — once before the 
start of the loop and once inside 
the loop. 


Indefinite iteration 
Obviously in this example it is not known how many times the program will 
have to check until the water temperature reaches 100°C. 


Therefore we have to use indefinite iteration. There are two ways of doing this 
in pseudo-code: you can use a REPEAT...UNTIL loop or a WHILE...DO loop. 


A REPEAT...UNTIL loop checks the condition when it gets to the end of 

the loop. This means that the statements contained within the loop will be 
executed at least once. A WHILE...DO loop checks the condition at the start 
of the loop so in some circumstances the statements contained within the 
loop will not be executed. 


REPEAT #This starts the REPEAT...UNTIL loop. 


RECEIVE temp FROM SENSOR #The temperature of the water is 
input from a temperature sensor. 
UNTIL temp = 100 #This sets the condition for the loop 
to stop. When ‘temp’ is equal to 100 
the loop will stop. 
Switch off kettle #This is the next command to be 
executed when the loop has finished. 


Using a WHILE...DO loop the algorithm shown above would be 


RECEIVE temp FROM SENSOR 

WHILE temp < 100 DO 
RECEIVE temp FROM SENSOR 

END WHILE 

Switch off kettle 


1 Write an algorithm expressed in pseudo-code that asks the user to enter 
a number between 1 and 6. If the number entered matches the value 
stored in the variable diceRoll the message ‘Well done you guessed | 
correctly.’ is displayed. Otherwise the user is invited to guess again. 


Use a WHILE...DO loop and include comments to explain what each 
line of code does. 


| 


2 Produce a second version of the algorithm using a REPEAT..-UNTIL loop. 


Definite iteration 


This is used when the number of iterations, or turns of the loop, is known 
in advance. 


One method is to use a REPEAT...TIMES loop. 
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REPEAT 50 TIMES 
SEND ‘*’ TO DISPLAY 
END REPEAT 


A FOR loop is another method of repeating a sequence of instructions a fixed 
number of times. 


A learner is designing a program to help younger children with their times 
tables. When a user enters a number the program will output the times 
table up to 12. 


RECEIVE number FROM KEYBOARD #The number entered is assigned 
to the variable ‘number’. 
FOR index FROM 1 TO 12 DO #The loop is set up using the 


variable ‘index’ which will 
change from 1 to 12 at each 
turn of the loop. 

SEND number * index TO DISPLAY #The value of ‘number’ is 
multiplied by the value of ‘index’ 
at each turn of the loop. 

END FOR #This command is used to close 
the loop. 


This loop will be repeated 12 times. At each turn of the loop the variable 
‘index’ is incremented by one. 


Variables in a FOR loop can be used to indicate the start and end values of 
the loop, for example 
start = 25 
finish = 50 
FOR index FROM start TO finish DO 
SEND ‘Congratulations’ TO DISPLAY 
END FOR 


_ Create an algorithm expressed in pseudo-code that asks a user to enter 

| a start number and an end number and then outputs the total of all the 
numbers in the range. For example, if the start number was 1 and the end 
number was 10, the total would be 55 (1+2+3+4+5+6+7+8+9+ 10). 
Tip: 


You should initialise the variable total to zero before the start of the loop. 
see ‘ —- — 
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| Logical operator: a Boolean 
operator using AND, OR and NOT. 


Another example of a fixed number of iterations is when you have to enter a 
password and you only get three attempts. It could be implemented in the 


following way. 


SET correct TO ‘LetMeIn' #The variable ‘correct’ is assigned 
the value of the password stored in 


the system. 


FOR index FROM 1 TO 3 DO 
SEND ‘Please enter your password.’ TO DISPLAY 
RECEIVE password FROM KEYBOARD 
IF password = correct THEN 
SEND ‘You entered the correct password.’ TO DISPLAY 
END IF 
END FOR 


The loop will ask for the password to be input three times, but what if the 
user gets the password correct on the first attempt? They do not want to 
have to enter it twice more. 


Another solution would be to use indefinite iteration and keep count of the 
number of iterations. 


SEL counteTO.0 #The variable ‘count’ is assigned 
the value 0. 
SET correct TO ‘LetMeIn’ #The variable ‘correct’ is assigned 


the value of the password stored 
in the system. 


REPEAT 
Sileeoune to count + 1 #The variable ‘count’ is 
incremented by 1 on each turn. 
SEND ‘Please enter your password.’ TO DISPLAY 
RECEIVE password FROM KEYBOARD 
IF password = correct THEN 
SEND ‘Correct password.’ TO DISPLAY 
ELSE 
SEND ‘Incorrect password.’ TO DISPLAY 
END IF 
UNTIL password = correct OR count = 3 


The loop will end if either of the conditions is met — if the password is 
correct or the number of attempts is equal to 3. 


We have used a compound comparison by joining two conditions together 
using an ‘OR’, which is a logical operator. 


16 


AND _ If two conditions are joined by the ‘AND’ operator, then they must 
both be true for the whole statement to be true. 

OR If two conditions are joined by the ‘OR’ operator, then either one 
must be true for the whole statement to be true. 

NOT The NOT operator reverses the logic of the AND and OR statements. 


The statement 'IF A = 3 AND B = 6' will be true only if the conditions 
are met, i.e. A and B are both equal to the values stated. 

The statement IF NOT (A = 3 AND B = 6) will be true whenever 

both A and B are NOT equal to the values stated i.e. either or both 
are not equal to those values. 

The NOT operator is often used when setting a selection criterion, | 
for example IF NOT LENGTH(password) > 8 THEN to ensure a 
minimum password length. 


Activity 13 

In Activity 9 you had to create an algorithm for a guessing game. This game 

is more difficult. It should generate a random number between 1 and 20. 

e Ask the user to guess the number. 

e Allow the user three attempts. 

e Display a message if the attempt is correct. 

e Display a message if the attempt is incorrect and inform the player if 
their attempt is too high or too low. 

¢ Display a message to the player after three incorrect attempts 
informing them of the correct number. 


Nested loops 

A nested loop comprises a loop within a loop. When one loop is nested 
within another, each iteration of the outer loop causes the inner loop to be 
executed until completion. 


In this example a nested loop is used to calculate and display the average 
mark achieved by each student in a group of twenty for a series of five tests. 


The outer loop iterates through each student in turn. The inner loop receives 
each set of five marks and adds them together. The outer loop calculates 
and displays the average mark before moving on to the next student. 


FOR student = 1 TO 20 
SET sum = 0 
FOR mark = 1 TO 5 


RECEIVE nextMark FROM KEYBOARD 
SET sum TO sum + nextMark 
END FOR 
SET averageMark TO sum/5 
SEND averageMark TO DISPLAY 
END FOR 
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Random number: a number within — 
a given range of numbers that is 
generated in such a way that each 
number in the range has an equal 
chance of occurring. 

There are many devices for 
generating random numbers. A die 
is used in games to get a random 


~ number from 1 to 6. Computer 


programming languages have a 
function for generating random 
numbers across variable ranges. 


In the Edexcel pseudo-code 
there is a useful built-in RANDOM 
command. 


RANDOM (upperLimit ) 


For example, number = RANDOM(6) 
would generate a random number 
from the numbers 1 to 6. 


Nested loop: a loop that 


runs inside another loop. The 
inner one executes all of its 


_ instructions for each turn of the 


outer loop. 


| Concatenation: the linking 
together of two or more items of 
information. 


Different symbols are used 

in other pseudo-codes and 
programming languages, with 
the ‘+’ symbol being the most 
popular. 


A learner has a Saturday job selling cups of tea and coffee. The tea is 
£1.20 per cup and the coffee is £1.90. He is supposed to keep a record of 
the number of cups of each he sells. 

Unfortunately he has been so busy that he has lost count but he knows 
that he did not sell more than 100 of each. 


He has collected £285. 

Write a program that will calculate how many cups of tea and coffee he sold. 
SET teaCost TO 1.2 

SET coffeeCost TO 1.9 

FOR numCoffees FROM 1 TO 100 DO 


FOR numTeas FROM 1 TO 100 DO #This loop for the teas is 
nested inside the loop for 


coffees. 
SET total TO (numCoffees * coffeeCost) + 
(numTeas * teaCost) 
IF total = 285 THEN 
SET teas TO numTeas’ #These ‘teas’ and ‘coffees’ 
variables are needed as 
the loops will continue and 
‘numCoffees’ and ‘numTeas 
will change. 
SET coffees TO numCoffees 
END) IF 
END FOR 
END FOR 


SEND ‘The number of teas is’ & teas & ‘and the number of 
coffees is’ & coffees TO DISPLAY 


: 


In this SEND command, four items of information are displayed. First there 
is some literal text enclosed in quotation marks — ‘The number of teas 

is’ — followed by the value of the variable teas, followed by some more 

literal text — ‘and the number of coffees is’ — followed by the value of the 
variable coffees. The four are joined by an ‘&’ symbol - known as the append 
operator. Joining items of information in this way is called concatenation. 


If this algorithm was converted into program code and executed this 
sentence would be displayed on the screen: 


‘The number of teas is 95 and the number of coffees is 90’ 


Create an algorithm that will print out the times tables (up to 12 times) 
| for the numbers 2 to 12. You should use concatenation so that the 
| printouts State 2X 2=4, 2X3=6 , 9 Sage cece 
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$1 Explain, using examples, how command sequence, selection and iteration are used in algorithms. 
$2 Explain what each of the six relational operators does. 


C1 Design an algorithm in pseudo-code that asks the user to enter their height (in metres) and weight (in 
kilograms) and displays their body mass index (BMI). The formula for calculating BMI is weight / height’. 


Comments make algorithms easier to understand. The # symbol indicates a comment in pseudo-code. 
Concatenation is the linking together of two or more items of information. 


— 


C2 Design an algorithm expressed as a flowchart to control the heating in a house. A thermostat monitors the 


temperature within the house. During the week the temperature should be 20°C between 6.00 and 8.30 in the 
morning and between 17.30 and 22.00 at night. At weekends it should be 22°C between 8.00 and 23.00. If the 
temperature in the house falls below 10°C at any time the boiler is switched on. 


How confident do you feel about your answers to these questions? If you're 
not sure you answered them well, try the following activities again. 


e For S1 have a look at the key terms on pages 2 and 8. 
e For S2 have a look at the table on page 10. 
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* The constructs command sequence, selection and iteration are the basic building blocks of algorithms. 
¢ Nested IF statements allow for more than two alternatives. 

¢ There are two types of iteration — definite and indefinite. 

Another name for iteration is loop. Loops can be nested. 


Working with algorithms 


Learning outcomes hh 


By the end of this section you should be able to: 


* describe the purpose of a given algorithm and explain how it works. 


¢ determine the correct output of an algorithm for a given set of data. 


¢ identify and correct errors in algorithms. 


Start 


’ 


Input length, 
width 


. 


area = 
length * width 


End 


Figure 1.8 Flowchart of an algorithm 
showing area of a rectangle 


The purpose of an algorithm 
When you look at an algorithm, expressed either in pseudo-code or as a 
flowchart, it is sometimes easy to see its purpose. 


Have a look at the algorithm in Figure 1.8. 
The purpose of this algorithm is to find the area of a rectangle. It works like this. 


¢ The length of the rectangle is input and is stored in the variable ‘length’. 

¢ The width of the rectangle is input and is stored in the variable ‘width’. 

¢ The variable ‘length’ is multiplied by the variable ‘width’ to find the area 
of the rectangle, which is stored in the variable ‘area’. 

e The value of the variable ‘area’ is output. 


Here is another algorithm, this time displayed in pseudo-code. 


Giaae woneellistope GMO) @ 


SET totalAgainst TO 0 #These variables hold the running totals 
and are initialised to 0 at the start of the 
algorithm. 

SET win TO 0 

SET loss TO 0 

SET draw TO 0 

SET anotherEntry TO ‘Y’ 


WHILE anotherEntry = ‘Y’ DO 
SEND ‘Enter goals for:’ TO DISPLAY 
RECEIVE goalsFor FROM KEYBOARD 
SEND ‘Enter goals against:’ TO DISPLAY 
RECEIVE goalsAgainst FROM KEYBOARD 
SET totalFor TO totalFor + goalsFOR 
SET totalAgainst TO totalAgainst + goalsAgainst 
IF goalsFor > goalsAgainst THEN 
SET win TO win + 1 


ELSE 
IF goalsFOR < goalsAgainst THEN 
SET loss TO loss + 1 


ELSE 
SET draw TO draw + 1 
END IF 
END IF 
SEND “Press ‘Y' to enter another result.’ TO DISPLAY 


RECEIVE anotherEntry from KEYBOARD 
END WHILE 
SEND 
SEND 
SEND 


& win TO DISPLAY 
& loss TO DISPLAY 
& draw TO DISPLAY 


ENG tage wins + 
‘Total losses: ' 
‘Total draws: ‘ 


So what can we say about this algorithm? 


Its purpose is to allow a user to enter match scores for a particular team 
and sport. It could be something like football or hockey. 


It calculates the total goals that were scored for and against the team. 

It calculates the number of matches that were won, lost and drawn. 

It displays the number of wins, losses and draws to the user. 

The variables to hold these values are initialised - declared and assigned 
a start value at the beginning of the algorithm. 

The algorithm uses indefinite iteration. It loops until the user presses any 
key other than the ‘Y’ key. 


Checking the output 

A good way to follow the reasoning behind an algorithm, and also to find 
any logic errors, is to use some sample data and check if the output is what 
you expect. 


Use the following scores as test data. 

9) 

3-2 

0-2 

1-1 

Calculate the expected end states of the variables from these results. 


Track the entry of each score through the algorithm and see if they are 
the same as your expected results. 


Trace tables 
The formal way of checking the logic of an algorithm is to use a trace table. 
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In the algorithm the SEND 
statement at the end of the loop 
uses double quotation marks. 
This is because the text to be 
displayed includes the letter ‘Y’ in 
single quotes. 


You can use double quotation 
marks in a SEND statement 
where the text includes single 
quotes or an apostrophe. 
Pseudo-code is very forgiving, 
so it doesn’t matter if you don’t 
do this, but it does matter when 
the algorithm is converted into 
program code. 


Logic error: an error in an 


algorithm that results in incorrect 
or unexpected behaviour. 


Trace table: a technique used 

to identify any logic errors 

in algorithms. Each column 
represents a variable or output and 
each row a value of that variable. 


Create a trace table for the following algorithm. 
SET number TO 3 
FOR index FROM 1 TO 5 DO 

SET numberl TO number * index 

SET number2 TO numberl * 2 

IF number2 > 20 THEN 

SEND number2 TO DISPLAY 

END IF 
END FOR 
This algorithm can be traced in the following table. 


3 

3 1 6 

3 Zz 12 

3 3 18 

3 4. i2 24 24 
3 5 15 30 30 


The value of the variable number remains at 3 throughout, but as the index 
increases from 1 to 5, then so do the values of numberl and numbere. 


When the value of numberz is greater than 20, its value is output. 


| Activity 16 
Complete a trace table for this algorithm. 
Sh onumberd 10 2 
SET number2 TO 3 
FOR index FROM 1 TO 5 DO 
SET numberl TO numberl * index 


SET number2 TO number2 + numberl 
END FOR 


Spend some time studying the algorithm to ensure that you fully 
understand it. 


In 1 you are asked to ‘explain’ how the algorithm works. A longer answer 
is required that includes all of the stages of the algorithm. Use the correct 
terms to explain the constructs. 

In 2 and 3 short answers are sufficient. 


In 4 calculate the charge for each person using the rules of the algorithm. 
Then calculate the overall charge and check to see if the family qualifies 
for a group discount. 
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This flowchart displays an algorithm used by 
Holiday Theme Parks Limited. 


Start 


charge =0 
total =0 
number = 0 
- 
. 


Input 
customer 


charge = £10 
number = number + 1 


total = total ~ charge 


Is 
there another 
customer in the 
group? 


MES 


— charge = £5 
number = number - 1 
: 
YES 
= charge £9 
7B 

1 Describe how the algorithm calculates the 
total amount that should be paid. 

2. Give two variables that are used in the 

YES algorithm. 

3 In the flowchart, two of the constructs 
are labelled A and B. State the type of 
each construct. 

_ 4 The Smith family is visiting the park. The 
~~ total =total - £10 family consists of two children, one aged 
8 and one aged 10, their two parents and 

: their grandfather, who is aged 65. Use 


the algorithm to calculate how much the 
family should have to pay for entry. 


Figure 1.9 Flowchart of an algorithm showing charges in a theme park 
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| 


Simulation: a representation of a 
real-world process or system. 


| Activity 17 
Here is part of a simulation for a payment system at a car park. 


SET parkCharge to RANDOM(20) #As this is a simulation, a parking 
charge is generated as a random 
number between 1 and 20, i.e. £1 
and £20. 


SET payment TO 0 


WHILE payment < parkCharge OR payment > 20 DO 
SEND ‘The charge is ‘ & parkCharge TO DISPLAY 
SEND ‘Enter payment up to £20 maximum.’ TO DISPLAY 
RECEIVE payment FROM KEYBOARD 
SET changeDue TO payment - parkCharge 
WHILE changeDue >= 10 DO 

SEND “£10 note’ TO DISPLAY 

SET changeDue TO changeDue - 10 
END WHILE 
WHILE changeDue >= 5 DO 

SEND “L5enotve “20 DISPLAY 

SET changeDue TO changeDue - 5 
END WHILE 
WHILE changeDue >= 2 DO 

SEND ‘£2 coin’ TO DISPLAY 

SET changeDue TO changeDue - 2 
END WHILE 

END WHILE 


What is the maximum charge at the car park? 


Explain how the algorithm calculates the change due and decides how 
many notes or coins should be given. Complete the algorithm so that it 
includes the payment of £1, 50p, 20p, 10p, 5p, 2p and 1p coins and add 
comments to show how the code works. If the charge is £6.90 and the 
person pays with a £20 note, use the completed algorithm to determine 
how many of each type of denomination (£20, £10 and £5 notes and £2, 


£1, 50p, 20p, 10p, 5p, 2p and 1p coins) will be issued. 


Identifying errors 
There are three types of error in computer programs. 


* Syntax errors occur when algorithms are being converted into program 
code. 


* Runtime errors occur when the program is executed. 
* Logic errors are errors in the design of algorithms. 
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Syntax and runtime errors will be covered in Chapter 2. Pseudo-code 
purposefully doesn’t really have any syntax so that the developer can 
concentrate on noting down the logic without having to be bothered 
with the rules of syntax. Obviously, you won't have syntax or runtime 
errors in pseudo-code. 


Here is an algorithm to find the average of two numbers. 

RECEIVE numberl FROM KEYBOARD 

RECEIVE number2 FROM KEYBOARD 

SET average TO numberl + number2 / 2 

SEND average TO DISPLAY 

This seems logical. Two numbers are input, they are added together and 
then they are divided by 2. 

However, if this algorithm was given 12 and 6 as the two numbers it 
would return 15 as the average instead of 9. There is a logic error. 
Instead of adding the two numbers and then dividing by two, as the 


So far you have been using the 
Edexcel pseudo-code that is 
provided in the specification. In 
the examination, questions will 
be asked using this version, but 
if you have to answer a question 
by writing pseudo-code, you do 
not have to use Edexcel’s version. 
As long as your answer is logical 
and can be understood by a 
competent person, then it will 


developer intended, it is dividing the second number by 2 and then be accepted. 
adding the result to the first number. 
The developer should have written the third line as 


SET average TO (numberl + number2) / 2 


In computer programming the order of precedence (the order in which 
you do each calculation) is the same as in mathematics and science — 


BIDMAS. 

This is how 32 x 9 + (5 - 2) would be evaluated. 
Brackets 3o a(S) 

Indices 9x 9 + (3) 

Division 

Multiplication 81 + (3) 

Addition 84 


Subtraction 

To calculate 24/3 — 2, the division would be calculated before the 
subtraction. 

24/3 =8 

8-2=6 


The best way to find logic errors is to use the technique in the section above 
~ use sample data and check if the actual output from the algorithm is as 
expected. You'll learn more about this in Chapter 2. 


Infinite loop: a loop that is 


, never-ending since the condition 


| 


required to terminate the loop is 
never reached. 


“J 


Activity 18 
This is part of a larger algorithm = 
designed to ensure that input data Start 
falls within a certain range. It is part : 
of a school management system and 
checks that the ‘year group’ entry is Input 
acceptable. It has learners aged 11 to yearGroup 
18 years with year groups of 7 to 13. : 
The staff using the system 
congratulated themselves on never a ' 
4 earGrou = 
making an error when entering the Soret 
year group, but when learner lists | 
were printed out they immediately EZ | 
received complaints. , 
What is the logic error in the ; | 
algorithm? Figure 1.10 Flowchart showing an | 
error in an algorithm | 


Often logic errors occur when designing loops. 


Study this algorithm. 
WHILE index < 10 DO 
SET index TO 1 
SEND index TO DISPLAY 
SET index TO index + 1 
END WHILE 
The expected output is 1, 2, 3, 4, 5, 6, 7, 8, 9. 
But there is a logic error. The variable ‘index’ is initialised in the wrong 
place. It should be done before the start of the WHILE loop. 


This algorithm would loop forever because at each turn in the loop the 
variable index is set to 1. It will never reach 10. This is an example of an 
‘infinite loop’. The algorithm should have been written as shown below. 
SET index TO 1 
WHILE index < 10 

SEND index TO DISPLAY 

SET index TO index + 1 
END WHILE 
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' Find and correct the errors in these algorithms. 


. © Example 1 


SET index TO 1 
WHILE index < 10 

SEND index TO DISPLAY 
END WHILE 


¢« Example 2 


SET index TO 1 
WHILE index < 10 
SEND index TO DISPLAY 
SET index TO index - 1 
END WHILE 


| « Example 3 


SEP imgexeTO 1 
WHILE index < 1 
SEND index TO DISPLAY 
SET index TO index + 1 
END WHILE 


¢ A logic error in an algorithm will produce an incorrect result. 


¢ Tracing the value of variables in an algorithm helps to identify logic errors. 
¢ Understanding the order of precedence of arithmetic operators and the significance of brackets will help you 


to avoid making logic errors. 


S1 Describe the purpose of an algorithm and explain how it works. 


$2 Explain, using examples, what a logic error is. 


S3 Use a trace table to check the output of an algorithm and identify any logic errors. Explain what BIDMAS 


means and demonstrate how this expression would be evaluated. 
ceo e+ (8 — 3) 


ral 


Challenge 


C1 State the purpose of this algorithm and explain the rules behind the calculation. 


SEND ‘Enter the weight of your parcel in kilograms’ 
TO DISPIAY 
RECEIVE parcelWeight FROM KEYBOARD 
IF parcelWeight <= 2 THEN 

postage = 8 
ELSE 

IF parcelWeight <= 10 THEN 

postage = 8 + ((parcelWeight — 2) * 2.5) 


ELSE 
postage = 8 + (8 * 2.5) + ((parcelWeight — 10) * 
So) 
IF END 
IF END 
SEND ‘The cost of posting your parcel is ‘ & postage 
TO DISPLAY 


C2 People often want to know the human-equivalent age of their dog or cat. The rules for calculating this are: 
A 1-year-old cat is equivalent in age to a 15-year-old human, a 2-year-old cat is equivalent in age to a 24-year- 
old human. Add four years for every year after that. 
A 1-year-old dog is equivalent in age to a 12-year-old human, a 2-year-old dog is equivalent in age toa 
24-year-old human. Add four years for every year after that. 
Design an algorithm expressed in pseudo-code that allows the user to input the age of their dog or cat and 
outputs its equivalent human age. It should allow the user to have another go 
Use a trace table and test data to check the logic of your algorithm. 


How confident do you feel about your answers to these questions? If you're 
not sure you answered them well, try the following activities again. 


¢ For S1 reread pages 20-21 and have another go at all the activities. 


* For S2 and S3 reread the section on trace tables on pages 21-22 and the 
section on identifying errors on pages 24-25. 
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Sorting and searching algorithms 


By the end of this section you should be able to: 


* explain the following sorting algorithms and be able to apply them. 
o Bubble sort 
o Merge sort 


* explain the following searching algorithms and be able to apply them. 
o Linear search 
o Binary search 


¢ explain how the choice of algorithm is influenced by the data structures and data values that need to be 
manipulated. 


evaluate the fitness for purpose of algorithms in meeting specified requirements efficiently using logical 
reasoning and test data. 


Two of the most common tasks in computer programs are sorting data into 
a particular order and searching for particular items of information. 


There might be millions of items of stored data and searching for 
information would be very inefficient if the data was not sorted. Imagine 
the confusion and difficulty of having to find something in a dictionary 
that wasn't in alphabetical order or planning a trip with train timetables 
that weren't sorted into time order. Even small lists such as football league 
tables or the top-20 music charts are much more useful if they are sorted 
into order. 


Arrays 

All sorting and searching algorithms work on lists of data. As you already 
know, a variable stores just a single value (e.g. SET age TO 15 or SET 
firstName TO ‘David)). 


But what if a programmer wants to store lots of related values, such as the 
names of a group of friends? A tedious way of doing it might look something 
like this. 


Sen freend2 TO ‘Barry’ 
' ; Array: an organised collection of 
Sart friiend3 TO ‘Cathexane’ : 
related values that share a single 


It would be much easier if all the names could be stored in one list to which denuner. 
the names of new friends can be added. That's exactly what an array allows 
you to do. 


An array with the identifier ‘arrayFriends’ would store the data items 
like this. 


arrayFriends = [‘Alice’, ‘Barry’, ‘Catherine’ ] 


This array has three elements. 


Pseudo-code has a built-in 
LENGTH command, which you 
can use to find the number 

of elements in an array. For 
example, SET size TO 

LENGTH (arrayFriends) would 
set the variable ‘size’ to 5. 


Remember: the elements have 
indexes 0 to 4. 


¢ Use the LENGTH function to 
find out how many marks 
there are altogether. 


e Use a variable maxMark — set 
initially to 0 - to find the 
highest mark. 


¢ Each mark in the array is 
compared in turn with the 
value stored in maxMark. 


¢ If itis higher then maxMark is 
updated. 


Ascending order: this is 
arranging items from smallest to 
largest (e.g. 1, 2,3, 4, 5,60ra, b, 
c,d, e, f). 

Descending order: this is 
arranging items from largest to 
smallest (e.g. 6,5, 4, 3,2, lorf,e, 
dG. Daa). 


Array indexes 

Each element in an array has an index number. In the example above the 
index number for ‘Alice’ is 0 and the index number for ‘Catherine’ is 2. (Index 
numbering starts at 0.) 


A new element can be inserted into an array like this. 


SET arrayFriends[3] TO ‘David’ 
SET arrayFriends[4] TO ‘Eva’ 


This would insert the name ‘David’ at index position 3 and the name ‘Eva’ at 
index position 4 — the fourth and fifth positions in the array. 


This code will traverse an array named ‘arrayFriends’ and print out each 

element of the array. 

FOR index FROM 0 TO LENGTH(arrayFriends) — 1 DO 
#The loop has to run to the length of 
the array minus 1 as indexing starts at 
0. For example, if there are 10 items 
then they will be indexed as 0 to 9 
(ory: 

SEND arrayFriends[index] TO DISPLAY 
END FOR 


Activity ral) 


An array named ‘arrayScores’ contains a set of marks that a learner has 
obtained during her computer science course. 


Create an algorithm expressed in pseudo-code to find and display her 
highest mark and her average mark. 


Activity 19 would have been a whole lot easier if the elements of the array 
had been sorted into order. We will next be looking at two ways this can be 
done. 


Sorting algorithms 


As sorting is such a widely used procedure, many algorithms have been 
created to carry it out. As with all algorithms, some are more efficient 
than others. 


Bubble sort 


When data is sorted, different items must be compared with each other and 
moved so that they are in either ascending order or descending order. 


The bubble sort algorithm starts at one end of the list and compares pairs of 
data items. If they are in the wrong order, they are swapped. The comparison 
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of pairs continues to the end of the list, each complete traversal of the list 
being called a pass. This process is repeated until there have been no swaps 
during a pass, indicating that the items must all be in the correct order. Traversal: travel across or 


, ; through something. An array can 
The algorithm can be described as follows. be es by Baie Na the 


‘Bubble sort (ascending order) |) fretto the ost clement in over 
' to examine the data stored at 
1 Start at the beginning of the list. 
2 Compare the values in position 1 and position 2 in the list — if they are 
not in ascending order then swap them. 
3 Compare the values in position 2 and position 3 in the list and swap if 
necessary. 


4 Continue to the end of the list. 
5 If there have been any swaps, repeat steps 1 to 4. 


Here is an example of a bubble sort in action. 


each index position. 


Pass 1 Pass 2 
Items 1 and 2 must | Items 1 and 2 are in 
] 3 1 
Bi be swapped. a Tt Q) | (@) i 3 % correct order. 


Items 1 and 2 are —~ TP a Items 2 and 3 ae 
] 
3 swapped. 2 2 l be swapped. 
bs = : _|.__. . 
1 4 


Items 2 and 3 are oa Items 2 and 3 have 
2 1@} () | eer) °) already in ascending tsa oa | : 3 ° } been swapped. 
order. =m) | 4) | 6) F Items 3 and 4 must 
5 Z 1%) Te | 3 | Items 3 and 4 must =! be swapped. 
| be swapped. —_—e- a I 
3 ; 61S 4 6 Items 3 and 4 have 
: ‘ 1 2 3 Items 3 and 4 have ) been swapped. 
____ been swapped. ‘ ‘ (@) ©) Items 4 and 5 do not 
5 4 F (6) a i Items 4and 5 must need to be swapped. 
now be swapped. 
‘ ¢ é E Items 4 and 5 have Pass 3 . 
been swapped. ‘a 6 Items 1 and 2 must 
Figure 1.11 A bubble sort > ———_-—__—— peskiganed: 
= 3 re OF 6 Items 1 and 2 have 
It would take a human three passes to carry out this | | | | es been swapped. 
bubble sort, but a computer would need four passes ; ‘ ; P " All items are now in 
because it must continue until there have been no swaps. the correct order. 


A computer cannot just look at all of the numbers at 
once and see that they are all in order. 


Do you know why it is called ‘bubble sort’? If you look carefully, you can 
see that gradually the largest items move to the end, like bubbles rising 
in water. After the first pass, the largest number is in its correct position. 
Then after the second pass, the next largest is in its correct position. 
This happens on each pass and so if the algorithm is to be made more 
efficient the last set of comparisons can be omitted. 


a4 


Start 


length = length of list 
position = 1 
switch = 0 


4 


Is List Swap List item 
item (position) > uc (position) and List 
List item (position item (position + 1) 

+1)? switch = switch + 1 


NO _ ¢ 
° 


position = position + 1 


The bubble sort algorithm can be represented as a flowchart as 
shown in Figure 1.12. 


Study the flowchart of the bubble sort algorithm. 


Using the variables declared, explain the logic behind the 
algorithm — explain how it functions to sort a list. 


The bubble sort algorithm can also be expressed in 
pseudo-code. It assumes that the items to be sorted are 
stored in an array. 


#The items to be sorted are in an array with the 
identifier ‘items’. 


~ SET length TO LENGTH(items) - 1 #The variable ‘length’ is set to the length of 
the array minus one to represent the last index 
NO Is position. 
- saya SET swapped TO 1 #The variable ‘swapped is assigned the value of 1. 
WHILE swapped = 1 DO #The loop will run while the value of ‘swapped’ is 
vies equal to 1. 
swapped = 0 #The variable ‘swapped’ is changed to 0. It will be 
ne : changed back to 1 if a swap occurs. 
° switch = 0? FOR index FROM 1 TO length DO #This will set up a loop from 1 to the value of 
‘length’ which is the length of the array minus one. 
VE IF items{index-1] > items[index] THEN #On the first loop this will compare the value of 
the item at index 1 with that at index 0. 
aad SET temp TO items[index-1] #This block of code will swap the two items if the 


first is larger than the second. 


SET items[index-1] TO items[ index] 


Figure 1.12 Bubble sort 
algorithm written as a 
flowchart 

END IF 


END FOR 


END WHILE 


SET items[index] TO temp 
SET swapped TO 1 


#Swapped is set to 1 to indicate that a swap has 
occurred and so the ‘WHILE’ loop will turn again. 


#This ends the ‘FOR’ loop and so the variable 
‘index’ will be incremented by 1 for the next 
iteration. When it is greater than the ‘length’ 
variable the ‘FOR’ loop will stop. 


A tutor has stored a set of class examination results in an array 
named ‘exam1’. Write an algorithm expressed in pseudo-code 
to sort the results into descending order and then output the 

| highest and lowest result. 
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Merge sort is a sorting algorithm that divides a list into two smaller lists 


and then divides these until the size of each list is one. Repeatedly applying Recursion: a process that 

a method to the results of a previous application of the method is called is repeated. For example, a 

recursion. document can be checked and 
edited, checked and edited and 

In computing a problem is solved by repeatedly solving smaller parts of | SO on until it is perfect. 


the problem. A part of a program can be run and rerun on the results of the "es 
previous run (e.g. repeatedly dividing a number by 2). 


Here is an example of a merge sort which will sort the following list into ascending order. 


EU) EM EY ah ala al | 


The list is recursively split into half to produce a left list and a right list each time. 


| || || 
a ee | ea 


This continues until there is only one item in each list. Therefore each list is sorted into order. 


The left and right lists are now recursively merged with the items in the correct order. 


The leftmost items in each list are the lowest items of those lists and the algorithm compares them — in this case 
4 with 2. The 2 is inserted in the new list and the 4 is then compared with the second number of the right list — 6. 
The 4 is inserted and the 6 is compared with the second number of the left list. 


Lo i | 


The algorithm now merges these two lists in the same way to produce the final sorted list. 
1 is compared with 2 and then 2 with 3, 3 with 4 etc. 


eye | fee) fe) bed be Le] Le] 


' Using a table like the one in the worked example, show how the following 
list would be sorted into descending order using merge sort. 


48 20, 9, 17, 13, 21, 28, 60 


Efficiency of sorting algorithms 
This graph compares the performance of the bubble and merge sort algorithms. 


Brute force: an algorithm 
design that does not include 
any techniques to improve 

| performance, but instead relies | 
on sheer computing power to try 
all possibilities until the solution 
to a problem is found. 


Divide and conquer: an 
' algorithm design that works by 
dividing a problem into smaller 
and smaller sub-problems, 
until they are easy to solve. 
The solutions to these are then 0 100 1000 15000 250000 500000 
combined to give a solution to Number of items 
the complete problem. 


Time (seconds) 
WwW 


Figure 1.14 A graph comparing the performance of bubble and merge sort algorithms 


The bubble and merge sort algorithms demonstrate two alternative 


Extend your knowledge [fy Mim delkecleSacek ioldtlkel iat 


Only two sorting algorithms are The bubble sort algorithm is said to be using brute force because it starts 
required for the specification: at the beginning and completes the same task over and over again until it 
bubble sort (the slowest) and has found a solution. 


merge sort (one of the most 
efficient). There are far more, and 
many of them are relatively easy 
to code. Research the insertion 
and selection sorts. 


The merge sort uses the ‘divide and conquer’ method because it 
repeatedly breaks down the problem into smaller sub-problems, solves 
those and then combines the solutions. 


The graph shows that a bubble sort is far slower at sorting lists of more 
than 1000 items, but for smaller lists the time difference is negligible. 


cLinearsearch As the bubble sort algorithm is easier to code it could be advantageous to 
use it for smaller lists of less than 1000 items. 


1 Start at the first item in the 


I Searching algorithms 
2 Compare the item with the To find a specific item in a list involves carrying out a search. Like sorting, 
search item. 


some methods of searching are more efficient than others. 
3 If they are the same then 


stop. Linear search 
4 \f they are not then move to A linear search is a simple algorithm and it is not subtle. It simply starts at 
the next item. the beginning of the list and goes through it, item by item, until it finds the 
5 Repeat 2 to 4 until the end of item it is looking for or reaches the end of the list without finding it. 


the list is reached. 
A linear search is sequential as it moves through the list item by item. 
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Activity 25 ? 
An array contains the names 

of the one hundred most 
downloaded performers on an 
online music streaming site. 


Produce an algorithm expressed 
: in pseudo-code that enables 

, a user to see if their favourite 

| performer is in the top 100. 


This flowchart displays a linear search algorithm, but some of the symbols 
| have not been labelled. Draw and complete the flowchart using the 
' labels provided. 


Start 


length = length of array 


— 


: 


target = search item 


—— II ee 


' 


Wes 
: 


Output array 
[index] 


YES 
End : Te 


End 


Labels 


Output search item ‘| 
not found 


Does array [index] = 


Is index > length? 
9 target? 


index = index + 1 Input search item 


| Figure 1.15 A flowchart displaying a linear search algorithm with labels missing 


Binary search 
Like a merge sort, a binary search uses a ‘divide and conquer’ method. 


Median: the middle number 

when the numbers fe BUC a S 
ascending or descending order 

(e.g. if there are 13 numbers, then | You have probably used a binary search method when trying to guess a 

the 7thnumber is the median). number between two limits. If you are asked to guess a number between 1 
and 20 you will probably start at 10, the middle number. If you are told this 


is too high, you will then guess 5, the middle number between 1 and 10, 
and then repeat this method until you find the correct one. 


If there are an even number 
of items in an ascending list, 
choose the item to the right of | 
the middle (e.g. if there are 10 
numbers, then choose the 6th as 
the median). 


In a binary search the middle or median item in a list is repeatedly selected 
| to reduce the size of the list to be searched — another example of recursion. 
If the selected item is too high or too low then the items below or above 

that selected item can be searched. 


To use this method the list must be sorted into ascending or descending 
order. It will not work on an unsorted list. 


Select the median item of the list. 
If the median item is equal to the search item then stop. 
lf the median is too high then repeat 1 and 2 with the sub-list to the left. 


If the median is too low then repeat 1 and 2 with the sub-list to 
the right. 


5 Repeat steps 3 and 4 until the item has been found or all of the items 
have been checked. 


In this list, the search item is the number 13. 


RR WN 


Select the 
3 13 24 27 (i) 39 45 60 69 median 
number. 


As this is too high, the sub-list to the left of the median must be searched. 


The median number 


3 13 27 of this sub-list is 


now selected. 


This is again too high and so the sub-list to the left must be searched. 


3 (3) The median number is now the 
search item. 


Figure 1.16 Binary search including sub-lists 


In this example, it took three attempts to find the search item. A linear 
search would have accomplished this with only two attempts. 
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| Display the stages of a binary search, as in the worked example above, to 
find the number 13 in this list. 


3913 15 21 24 27 30 36 39 42 54 69 


Here is part of the algorithm displayed as pseudo-code. Some of the lines 
are incomplete as indicated by ‘x’ symbols. 


#The items to be sorted are in an array with the 


eA RXR RMR KXKXN NX KKXRX RMX XXX identifier ‘aList’ containing the values: 3, 13, 24, 27, 31, 
39, 45, 60, 69. 

SET start TO 0 #The variable ‘start’ is set to 0 — the index of the first 
item in the list. 

SET end TO LENGTH(aList) — 1 #The variable ‘end’ is set to the index of the last item in 
the list. 

SET found TO False #The Boolean variable ‘found’ is assigned the value ‘false’. 

SEND ‘Please enter the search item.’ TO DISPLAY 

RECEIVE target FROM KEYBOARD #The user is asked to enter the search item. 

WHILE start <= end AND found = False DO #A while loop is set up with these conditions as the loop 


should stop either when the target is found or all of the 
list has been searched, i.e. when the ‘start’ and ‘end’ 
values are the same. 


SET middle TO (start + end) DIV 2 #The median is found and is assigned to the variable 
‘middle’. 
IF aList[middle] = target THEN #If the target is found, the user is informed and ‘found’ 


is set to ‘true’ to stop the WHILE loop. 
found = True 
Senp target + ‘ is in the list” TO DISPLAY 
END: IF 
IF target < aList[middle] THEN 
end = XXXXXXXXXXKXXXXXXXXXXXXKXXX 
BSE 
start = XXXXXXXXXXXXX 
END IF 
END WHILE 
IF found = False THEN 
SEND (‘The search item is not in the list.’) TO DISPLAY 


END IF 


cae 


Complete the missing sections of the pseudo-code. 
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Efficiency of searching algorithms 

In the example on page 36, the linear search was more efficient because it only 
had to carry out two comparisons instead of the three for binary search. But is 
this always the case? 


Searching algorithms can be compared by looking at the ‘worst case’ and the 
‘best case’ for each one. 


If you wanted to find a particular item in a list of 1000 items these are the best 
and worst case scenarios for the linear search and binary search algorithms. 


Linear search 

A linear search starts at the first item and then works through sequentially. 
The best case would be if the item is first in the list. 

The worst case would be if it is the last in the list. 

Therefore in this example the average would be 500 comparisons. 


Binary search 


The best case would be if the item is in the median position in the list. The 
search would require only one comparison. 


For the worst case it would have to choose the following medians until it 
finally hit the target. 


(This assumes that the target is always smaller than the median.) 
Attempt Median 
1 500 


KON CO ONS ND 
— 
ON 


10 il 
Therefore the worst case for the binary search is ten comparisons. 


The binary search is therefore far more efficient than the linear search. 


So should a binary search be used every time? That depends on the 
circumstances. The binary search has one great disadvantage — the list must 
be already sorted into ascending or descending order and therefore a sorting 
algorithm must be applied before the search. 


If the list is to be searched just once then a linear search would be better, but 
if there is a large list that will be searched many times then sorting the list and 
using binary search would be better. Once the list has been sorted new items 
can be inserted into the correct places. 
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(eeimeiisguestion Te 
A tutor has stored learner surnames in an array as shown below. 
Marek Jackson Bachchan : Wilson | Abraham i French Smith 
1 Show the stages of a bubble sort when applied to this data. 
The tutor has a sorted list of names from another class as shown below. 


Azikiwe . Bloom Byrne | Davidson | Gateri , Hinton _| Jackson | Linton = Smith Wall 


2 Show the stages of a binary search to find the name ‘Jackson’ when applied to this list. 


These questions are testing knowledge of the sort and search algorithms. 


1 The answer should be set out to show how the data is progressively 
sorted using the bubble sort and the result of each pass should be 
shown. 


2 This question is to check that you know that this is a recursive method 
where the median Is repeatedly selected. 


e An array stores multiple items of data. 


¢ There are many algorithms for sorting and searching data. 
¢ The choice of algorithm depends on the data that is to be processed. 


S1 Describe the differences between the ‘bubble sort’ and ‘merge sort’ algorithms. 
52 Describe how a binary search algorithm finds the search item. 


Challenge 
C1 Explain when a linear search might be preferable to a binary search even though the binary search algorithm 


is more efficient. 
C2 Discuss the advantages of using arrays in algorithms. 


How confident do you feel about your answers to these questions? If you're 
not sure you answered them well, try the following activities again. 


e For S1 have a look at pages 30-33. 
e For S2 have a look at pages 36-37. 
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1.2 Decomposition and abstraction 


Learning outcomes a» 


By the end of this section you should be able to: 

* analyse a problem by investigating requirements (inputs, outputs, processing, initialisation) and design a 
solution. 

¢ decompose a problem into smaller sub-problems. 

° explain how abstraction can be used effectively to model aspects of the real world. 

* program abstractions of real-world examples. 


Problem solving 


The tasks of a computer scientist include defining and analysing problems; 
creating structured solutions — algorithms; coding the solutions into a form 
that can be implemented by a computer. 


Computational thinking: the 
thought processes involved in 
formulating problems and their 


solutions so that the solutions These tasks are part of what is known as computational thinking. 

are represented in a form that One of the skills required for computational thinking is algorithm design, 
can be effectively carried out bya = which we’ve covered in detail in this chapter. If there is a fault in the 
computer. algorithm design then the program will not work, however good a coder you 
Decomposition: breaking a are. Two other skills are decomposition and abstraction. 


problem down into smaller, more ! 


Decomposition 
manageable parts, which are ; 
onmeeertocolve. Decomposition is usually the first step in the problem-solving process. 


Once a problem has been broken down and the sub-problems have been 


pe magiion. Te plow se! identified, algorithms to solve each of them can be developed. 


removing or hiding unnecessary 

detail so that only the important | Decomposition means that sub-problems can be worked on by different 

points remain. , teams at the same time. As smaller algorithms are developed for each sub- 

—— problem, it is easier to spot and correct errors and when the algorithm is 
developed into a program, code can be reused. 


A learner has been set the task of creating a computer version of the 
‘noughts and crosses’ game where a user plays against the computer. 


he - a -. 
Design of How to keep ] How the How the 
interface , trackofwhich — ‘computer’ will ‘computer’ will 
showing the squareshave decide which = | decide when the 
3x3 grid. | | been selected square to select. | | gameis over and 
by’X’and‘O’and_ , , who has won. 


which are free. 


Figure 1.17 Sub-problems to be solved to create a noughts and crosses 
computer program 


The diagram shows some of the sub-problems that must be solved in 
order to solve the complete problem and create a version of the game. 
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Abstraction 


We use abstraction all the time in our daily lives. We abstract the essential 


features of something so that we can understand what people are trying 
to communicate. 


Somebody might say, ‘l was walking down the street 
when | saw a cat.’ You immediately understand what 
they mean by ‘street’ — probably a road with a pavement 
and houses or shops along the side of it. Similarly you 
can picture the cat - a smallish animal with fur, four legs 
and a tail. An animal that is basically ‘cattish’. You have 
extracted the basic properties of animals called cats so 
that you can recognise one when you see one or imagine 
one when somebody talks about a cat. 


What you picture is very unlikely to be exactly like the 
actual street and cat that the person experienced. But 
because of our ability to abstract, the person did not have 
to go into unnecessary painstaking detail about exactly 
where they were and what they saw. They wouldn't get 
very far with the story if they did. 


When we create algorithms we abstract the basic details of the problem and 
represent them in a way that a computer is able to process. 


Yasmin is designing a computer version of a game in which users have to 
throw a die to determine their number of moves. 

In the computer game the users can't have an actual die, so she will have 
to design a ‘pretend’ or virtual die that behaves in exactly the same way 
as a real-life die. 

Yasmin will have to use her powers of abstraction to work out the 
essential features of a die and then represent them in computer code. 
To represent the die she will have to create a routine that will select a 
random number from 1 to 6 because that’s what a die does. 


Yasmin has used abstraction to model a real-life event. 


Levels of abstraction 

There are different levels or types of abstraction. The higher the level of 
abstraction, the less is the detail that is required. We use abstraction all the 
time in accomplishing everyday tasks. 


When programmers write the ‘print’ command they do not have to bother 
about all of the details of how this will be accomplished. They are removed 
from them. They are at a certain level of abstraction. 


A driver turning the ignition key to start a car does not have to understand 
how the engine works or how the spark to ignite the petrol is generated. It 
just happens and they can simply drive the car. That is abstraction. 
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The diagram on page 40 showed some of the sub-problems that the 

problem of creating a noughts and crosses game could be divided into. 

The following could be written at a high level of abstraction. 

* The computer goes first. Then the user. This continues until either one 
wins or all of the squares have been used. 


Immediately a pattern can be recognised ~ a loop will be needed. 


Inputs and outputs 

The following inputs from the user will be needed. 

e Start the game. 

e Entries for the user. 

¢ Select a new game or finish. 

The following outputs will be needed. 

¢ A message to inform the user when it is their turn. 

¢ Amessage to inform the user if they try to select a square that has 
already been used. 

¢ A message to inform the user if the game is a draw. 

¢ A message to inform the user if they or the computer has won. 

¢ A message to ask the user if they want to play another game or want 
to finish. 

Processing and initialisation 

The following processing will be needed. 

e Set up the grid with the nine squares. 

¢ Initialise all variables to a start value. 

¢ Decide which square the computer will select. 

¢ Allow the user to select a square. 

¢ Check if the user has selected an already used square. 

¢ Check if the computer or the user has won. 

¢ Check if all squares have been used and the game is a draw. 

¢ Allow the user to select a new game or finish. 


The solution is still at a high level of abstraction and more details will need 
tobe added. 


For example, the programmer will need to decide how the game will record 
which player has selected each square; how the computer will decide which 
square to select; how the game will decide if the computer or the user has won. 


The programmer will have to go into more and more detail or move to lower 
levels of abstraction. 


Eventually the programmer will be able to design an algorithm for the game 
and code it using a high-level programming language such as Python or 
Java. Even before they start to implement the game, they will need to plan 
how they will test the finished program to make sure that it works correctly; 
what test data they will use; what outcomes it should produce. 
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Coding an algorithm 


High-level programming languages make it easier for a programmer to write 
code. Unfortunately, the processor, which has to execute the program, cannot 
understand the language it is written in, so it needs a translator to translate 
the code into the only language it does understand - a stream of 1s and Os. 


These high-level languages are therefore at a high level of abstraction - very 
far removed from the actual language of a computer. 


The processing can be split into parts. For example, in the example of the 
noughts and crosses game there could be separate algorithms for: 


* deciding where the computer should make its next selection — it could be 
called ‘computer entry’; 


¢ checking if the computer or the player has won - it could be called ‘check 
if won’; 


¢ checking if there are any empty squares left — it could be called ‘check draw’. 


These separate algorithms could be used when they are needed. It is 
efficient because it means that the same code doesn’t have to be rewritten 
whenever it is needed. 


These items of code are called subprograms. 


In Chapter 2 we'll look in detail at how subprograms are used to reduce the 
complexity of programs and to make them easier to understand. 


In the die example above, the designer could write a subprogram called 
‘die’ that generates a random number from 1 to 6. In the main program the 
designer could just call the ‘die’ subprogram without having to think about 
how to implement it each time. 


Activity 28 ? 
In a game each player spins a wheel that is divided into four colours: 

red, blue, green and yellow. Each player has to answer a question on a 
particular topic depending on the colour next to a pointer when the wheel 
stops. Red is for science, blue for history, green for general knowledge 
and yellow for geography. A player scores two points if they answer 
correctly on the first attempt and one point for being correct on the 
second attempt. The first player to reach 30 points is the winner. 

Your task is to design a computer version of the game for up to four 
players. You must analyse the problem and list all of the requirements; 
decompose the problem, list all the sub-problems and write a brief 
description of each; list all of the input, output and processing 

/ requirements. 

| One of the requirements that will have to be modelled is the spinning of 

| 


the wheel. Using a written description and pseudo-code show how this 
could be done. 
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Subprogram: a self-contained 
module of code that performs a 
specific task. It can be ‘called’ by 
the main program when it 

is needed. 


Computer models or simulations 
of real life are widely used. It is 
far cheaper and safer to train 
pilots on flight simulators than 
on real aircraft. They are also 
used in weather forecasting, 
designing and testing new cars 
and bridges and even teaching 
people to drive. A computer 
model is used by the Chancellor 
of the Exchequer to predict what 
will happen if changes are made 
in the Budget (e.g. if taxes are 
raised or lowered). 


Extend your knowledge » 


Complete the noughts and crosses game by coding it in the language you 
are studying. See if you can end up with a working game. 


* Computational thinking is an approach to solving problems that includes techniques such as decomposition 
and abstraction. 


* Problems are easier to solve if they are decomposed into smaller 
sub-problems. 


¢ Abstraction is used to remove unnecessary detail to make a problem easier to understand and solve. 


¢ When designing a solution to a problem the inputs, outputs and processing requirements should be identified 
at the outset. 


oy 7 


Strengthen 
$1 Explain what is meant by ‘decomposition’ and the benefits it provides for programmers. 

S2 Explain what is meant by ‘abstraction’. 

Challenge ‘ 


C1 Describe examples of the use of ‘decomposition’ and ‘abstraction’ when solving a problem. 
C2 In your own words explain what is meant by ‘computational thinking’. 


How confident do you feel about your answers to these questions? 
If you're not sure you answered them well, reread pages 40-42. 
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Programming 


2.1 Develop code 


Learning outcomes hh 


By the end of this section you should be able to: 

* explain the difference between algorithms and programs. 

* code an algorithm in a high-level programming language. 

¢ describe the characteristics of data types and select appropriate data types for variables. 


* use sequence, selection and iteration in your programs. 


Algorithms and programs 
As you learnt in Chapter 1, an algorithm is a precise method of solving a 


‘Keytetine——— aan’ problem. It consists of a sequence of unambiguous, step-by-step instructions. 
| Aprogram is an algorithm that has been converted into program code so 


Execution: the process by that it can be executed by a computer. A well-written algorithm should be 
which a computer carries out | — free of logical errors and easy to code in any high-level language. 
the instructions of a computer 


| As part of this course you will learn to write programs in a high-level 

| programming language. All high-level programming languages resemble 
natural human languages, which makes them easier for humans to read 
and write but impossible for computers to understand without the help 
of a translator. You will learn more about how a program written ina 
high-level language is translated into machine code - the language of 
computers — in Chapter 4. The aim of this chapter is to develop your 
generic programming skills rather than to teach you how to program in 
a particular language. 


program. 


A computer programmer at work writing code 
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In Chapter 1 you were introduced to an algorithm for keeping track of a sport 
club's wins, draws and losses (see pages 20-21). Here is what it looks like coded 
in the Python programming language. 


SeT totalFor TO 0 totalFor = 0 
SET totalAgainst TO 0 totalAgainst = 0 
Saf win TO 0 win = 0 
SET loss TO 0 ‘loss = 0 
SET draw TO 0 draw = 0 
SET anotherEntry TO ‘Y’ anotherEntry = ‘Y’ 
WHILE anotherEntry = ‘Y’ DO while anotherEntry == ‘Y’: 
SEND ‘Enter goals for:’ TO DISPLAY - 
RECEIVE goalsFor FROM KEYBOARD goalsFor = int(input(‘Enter goals for:’)) 
SEND ‘Enter goals against:’ TO DISPLAY 
___ RECEIVE goalsAgainst FROM KEYBOARD goalsAgainst = int(input(‘Enter goals against:’) 
SET totalFor TO totalFor oe goalsFOR totalFor = totalFor + goalsFor 
SET totalAgainst TO totalAgainst + goalsAgainst totalAgainst = totalAgainst + goalsAgainst 
IF goalsFor > goalsAgainst THEN if goalsFor > goalsAgainst: 
SET win TO win + 1 win = win + 1 
ELSE | 
IF goalsFOR < goalsAgainst THEN elif goalsFor < goalsAgainst: 
SilelossetOcliess, + 1 loss = loss + 1 
ELSE 
else: 
_ SET draw TO draw + 1 draw = draw + 1 
END IF 
END IF ———— | 
"SEND ‘Press ry! to enter another result.’ TO DISPLAY 
RECEIVE anotherEntry from KEYBOARD anotherEntry = input(’Press ‘Y’ to enter another result”) 
END WHILE . 
“SEND otal wins: ’ & win TO DISPLAY print(’Total wins:’,win) 
SEND ‘Total losses: ‘ & loss TO DISPLAY | print(‘Total losses:',loss) 
“SEND ‘otal draws: ‘ & draw TO DISPLAY print(‘Total draws:',draw) 


As you can see, the algorithm and the program are similar, but not the same. 
Some obvious differences are: 


¢ Command words in pseudo-code are written in capital letters. In Python they 
are in lower case. 

The SET ... TO pseudo-code command is replaced in Python by an equals sign. 
The RECEIVE ... FROM pseudo-code command becomes ‘input’ in Python. 
The SEND ... TO DISPLAY pseudo-code command translates into ‘print’. 
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Syntax error: an error that 
occurs when a rule of the 


| 


programming language is broken. | 


Activity 1 


Start compiling a reference 

, guide that shows the high-level 

program code equivalents to 

| the pseudo-code commands 

you use. Begin by revisiting 

i the algorithms you wrote while 
studying Chapter 1. 


? 


¢ The equivalent of IF ... THEN in pseudo-code is ‘if’ on its own with a colon 
at the end of the line. 


* In fact, there are quite a number of colons in the Python code and none in 
the pseudo-code. 

¢ There are no END statements in Python. Instead, indentation is used to 
indicate the start and end of a selection statement or a loop (see page 55). 


¢ In Python the symbol for the relational operator equals is ‘==’. This 
enables the computer to tell the difference between the arithmetic 
operator ‘=’ and the relational operator. 


The two also have features in common: 


¢ Both use descriptive variable names. 
¢ Indentation features in both. 


¢ The three programming constructs — sequence, selection and iteration - 
are the basic building blocks of both. 


What you can’t see from just looking at the code is that the rules of the 
programming language, its syntax, are much more important in a program 
than they are in an algorithm. 


You can get away with missing out a bracket, getting the indentation 

wrong or misspelling a command word in pseudo-code because the target 
audience is ahuman and humans are able to make allowances for any 
shortcomings in the code. The same can't be said for computers. If the code 
isn't exactly right, the computer won't be able to execute it and will flag up 
a syntax error. Needless to say, you are bound to encounter syntax errors 
frequently while learning to program. 


Not all programming languages use the same symbols to represent 
arithmetic operators. Here are the symbols Python and Java use to 
represent modulo (MOD), integer division (DIV) and exponent (4): 


Python Java 
Modulo % % 
_ Integer Division // Math.floor(a / b) 
Exponent ae Math.pow(a, b) 


You need to check which symbols the language you are learning uses for 
arithmetic operators. 


Data types 

As you learnt in Chapter 1, algorithms use variables (named memory 
locations) to store values. Variables havea variety of uses, for example 
controlling the number of times a loop is executed, determining which branch 
of an IF statement is taken, keeping running totals and holding user input. 
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When algorithms are converted into programs the computer needs to 
be told what type of data is stored in each variable. Every programming 
language has a number of built-in data types. 


integer Used to store whole numbers | 30 age = 30 
without a fractional part. | numbers 
real Used to store numbers witha | 25.5 weight = 25.5 
fractional part (decimal place). price = 12.55 
Real numbers are sometimes 
referred to as floats (short for 
floating point). ; | 
Boolean Only has two possible values: False | correct = False 
True or False. lightOn = True 
character Used to store a single character ‘m’ gender =‘m’ 
or symbol. char =“ 


Table 2.1 Common data types 


When writing pseudo-code you don't have to specify the data types of 
variables. However, data types become much more important once you start 
programming in a high-level language because the data type of a variable 
determines the operations that can be performed on it. 


For example, the result of multiplying a value by 5 differs according to its 
data type. 


integer 8*5=40 
real S07 5— 400 
character 8 >=1868e8. 


The method of declaring variables differs between programming languages. 
Some languages, such as Python, automatically select the appropriate data 
type for a variable based on the data assigned to it. Others, such as Java, require 
the data type of variables to be declared before the variables can be used. 


Although you don’t have to declare the data types of variables in pseudo- | 


code, it is a good idea to do so. You can simply list the variables and their | 


data types at the start of an algorithm, for example: | 


INTEGER age 

REAL weight 
BOOLEAN correct 
CHARACTER gender 
You can also specify the data type of a variable in the RECEIVE statement, 
for example: 

RECEIVE age FROM (INTEGER) KEYBOARD 

mae price FROM (REAL) KEYBOARD 


Data type: specifies what kind of 
data it can hold. Common data 
types are integer, real, Boolean 
and character. The data type of a 
value determines the operations 
that can be performed upon it. 


Programming 


Initialisation: the process of 
assigning an initial value to a 
variable. 


Assignment statement: the 
- SET...TO command is used to 

initialise variables in pseudo- 

code, for example: 

SET anotherGo TO 0 

SET correct TO False 


Type coercion: the process of 
converting the value stored ina 
variable from one data type to 
another. 


Investigate how type coercion 
is handled in the high-level 
| language you are studying. 


Variable initialisation 

When a variable is declared, the computer allocates it a location in memory. 
Initially this location is empty, so before a variable can be used it has to be 
given a value. 


You can put an initial value into a variable by 


¢ initialising it when the program is run (e.g. SET total TO 0); 


¢ reading a value from a keyboard or other device (e.g. RECEIVE 
admissionCharge FROM (INTEGER) KEYBOARD). 


Once a variable has been initialised an assignment statement is used to 
change its value (e.g. SET total TO total + admissionCharge). 


lf a variable, such as a loop counter, is intended to hold a running total, 
then it should always be initialised to a starting value. Some programming 
languages won't execute if the programmer fails to do this; others will do so 
but may well produce some unexpected results. 


Investigate what data types are available in the high-level language 
you are studying. Produce a table similar to the one above to 
summarise your findings. 

2 Identify an appropriate data type for each of these items. 
a_ The test score of an individual learner. 
b The average score for a group of learners. 
c Whether or not the pass mark for the test has been achieved. 

3 Look back over the algorithms you wrote in Chapter 1 and identify 
instances of variable initialisation. | 


Type coercion 

Sometimes the data type of a variable gets changed during program execution. 
This is known as type coercion. For example, if an integer value and a real 
value are used in an arithmetic operation, the result will always be a real. 


Languages such as Python, where the data type is assumed from the context, 
have commands that allow the data type to be changed. 


A theme park uses a program to monitor the number of people entering 
and exiting the park. The maximum number of visitors at any one time 
must not exceed 10,000. When the number of people in the park reaches 
the maximum a ‘Park Full’ message is displayed at the entrance gate. 
Children can visit the park free of charge, but each accompanying adult 
must pay £2.50 admission. The program keeps track of the amount of 
money collected at the gate. 


1 Identify the variables needed in the program. 
| 2 Select an appropriate data type for each variable and constant. 
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Command sequence, selection and iteration 

In Chapter 1 you learnt that the three key building blocks of algorithms 
are command sequence, selection and iteration. In this chapter you will 
have the opportunity to implement these constructs in the high-level 
programming language you are studying. 


Command sequence 


A command sequence is a set of instructions that the computer executes 
one after another in order. Command sequences are usually combined 
with loops and selection statements. 


| Activity 5 


RECEIVE numberl FROM (INTEGER) KEYBOARD 
RECEIVE number2 FROM (INTEGER) KEYBOARD 
SET resultl TO numberl / number2 

SEND resultl TO DISPLAY 

SET result2 TO numberl MOD number2 

SEND result2 TO DISPLAY 

SET result3 TO numberl DIV number2 
SEND result3 TO DISPLAY 


1 Describe what this algorithm does. 
2 State the output of the algorithm given the following inputs: 


arc 

b 10,3 
| ¢ 20,6 
' 


3 Implement this algorithm in the high-level language you are studying. 


Selection 
The selection construct is used to create a branch in a program. The 
computer selects which branch to follow based on the outcome of a 
condition, for example: 
IF day = ‘Saturday’ OR day = ‘Sunday’ THEN 
eer alarm TO il 
ELSE 
SET alarm TO 8 


A standard IF... THEN...ELSE statement provides two alternatives. If there 
are more than two, then a nested IF must be used. However, many high-level 
programming languages have an additional built-in selection construct that 
does away with the need for a nested IF statement. 


YY. 


: Activity 6 ? 


1 Investigate the selection 
statements that are 
available in the high-level 
programming language you 
are studying. 

2 In Chapter 1, Activity 8 (on 
page 11) you created an 
algorithm that inputs a 

learner's age and calculates 

how many driving lessons they 
will require according to a set 
of rules. 

Implement this algorithm in 

the high-level programming 

language you are studying. 


3 In Chapter 1, Activity 10 (on 
page 13) you studied an 
algorithm for calculating the 
grade achieved in an end-of- 
topic test. 

Implement this algorithm in 
the high-level language you 
are studying. 


CC a 
It is sometimes necessary to 
place one loop statement within 


another loop statement. This is 
| called a nested loop. 


Many programming languages allow you to test multiple conditions 
within the same IF statement to save you having to use nested IFs. 
Here’s how it’s done in Python and Java. 


Python Java 


if numl > num2: 


1 eum) > a2 et 


print (num1 ) System.out.printin(numl1) ; 


elif numl < num2: } else if (numl < num2) { 


print (numz2 ) System.out.println(num2); 


else: } else { 


print( ‘Equal’ ) SYStem.out.princin( “Equal”; 


Loops 

A loop is another name for an iteration. Loops are used to make a 
computer repeat a set of instructions more than once. There are two 
types of loop — definite and indefinite. 


A definite loop is used when you know in advance how often the 
instructions in the body of the loop are to be repeated. For example, if 
you want the computer to display a character on the screen for a fixed 
amount of time and then remove it. 


An indefinite loop is used when the number of times a loop will need 
to be repeated is not known in advance. For example, if you want 

to give a user the option of playing a game as often as they want. 
Indefinite loops are repeated until a specified condition is reached. 


Every programming language has a number of built-in loop 
constructs. You will need to explore the ones provided in the language 
you are studying. 


' Here’s how to write a WHILE loop in Python and Java. 


Python 


counter = 0 


Java 
int counter =10; 


while counter < 10: while(counter < 10) { 


print(counter) System.out.println(counter); 
counter = counter + 1 COuUnEeET tT 
} 


1OP Tip 


1 Describe what these two algorithms do and implement them in the Be careful not to create an infinite 


| 
' : 
high-level programming language you are studying. | loop — one that will never end. 
a Algorithm A _ Always ensure that the condition 


_ to terminate the loop will 


FOR index FROM 1 TO 10 DO eventually be met 


SEND index * index * index TO DISPLAY 
END FOR 


b Algorithm B 


SET counter TO 10 
WHILE counter > 0 DO 
SEND counter TO DISPLAY 
SET counter TO counter - 1 


END WHILE nig 
Activity 8 ? 


2 in Chapter 1, Activity 11 (on page 14) you created two versions of an 
algorithm for a dice game: one used a WHILE...DO loop and the other | 
a REPEAT...UNTIL loop. 
Implement both versions in the high-level programming language 
you are studying. 


3 In Chapter 1, Activity 12 (on page 15) you created an algorithm that 


In Chapter 1, Activity 17 (on page | 
24) you developed an algorithm 
| using a nested loop for part of a 
simulation of a payment system 
| at a car park. 


sums all the numbers in a range. Implement this algorithm in 
Implement this algorithm in the high-level programming language the high-level programming 
you are studying. language you are studying. 


Summary > 


¢ A program is an algorithm that has been converted into program code. 

¢ Pseudo-code is far more forgiving than program code. 

e The four basic data types are integer, float/real, Boolean and character. 

e The data type of a variable determines the operations that can be performed on it. 
e Data types don’t have to be declared in pseudo-code but it’s a good idea to do so. 


¢ Variable and type declarations, command sequences, selection and iteration are four of the structural 
components of a program. 


$1 Explain why variables are needed. 
52 Give examples of the four data types. 
S3 Describe how selection and iteration are implemented in the high-level language you are studying. 


C1 Describe these structural components of a program: variable and type declarations, command sequences, 
selection and iteration constructs. 


How confident do you feel about your answers to these questions? If you're 
not sure you answered them well, try redoing the activities in this section. 
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2.2 Making programs easy to read 


Learning outcomes bh 


By the end of this section you should be able to: 


¢ explain the benefit of producing programs that are easy to read. 
* use techniques to improve the readability of code and describe how code works. 


Code readability 


You should always try to ensure that any code you write is easy to read and 
understand. This benefits you and anyone else who needs to understand 
how your programs work. 


It’s surprising how quickly you forget. Try revisiting the programs you have 
already written in this chapter and make sure it is still clear to you what they 
do and how they work. Imagine how much more difficult it would be to make 
sense of a complex program with lots of variables, subprograms, nested 
loops and multiple selection statements. 


Contrary to what you might think, programming is not a solitary activity. 
Programmers usually work in teams, with each programmer developing a 
different part of the program. This only works if they all adopt a consistent 
approach to writing readable code. 


Le Python 3.4.1: example2.py - C:/Python/example2.py - Oo x 
File Edit Format Run Options Windows Help 

fimpert random - + 
game=T1iue t 
|while game==True: 


Numi=int (input ("Choose one of the following: 1 Rock, 2 Paper, or 3 Scissors:”) 
Num2=random. randint (1, 3) 
if Num2==1: 
print ("Computer chooses Ruck") 
elit Num2==2: 
print ("Computer chooses Paper”) 
eles: 
print ("Computer chooses Scissors") 
if Num1l==Num2: 
print("It's a draw”) 
elif Numl==1 ani Num2==3: 
print ("You win!) 
@lit Numl==2 an! Num2==1: 
print ("You win!) 
elif Numl==3 .1n1 Num2==2: 
print ("You win!") 
Css 
print("Computer wins") 
text=linput ("Play again? Y/N:") 
Vitec lm yt. 
game=tralre 


|. 


Ln: 26 Cok 


° 


Figure 2.1 An example of Python code 


—— tt Making programs easy to read 


The programmer who produced this program did not follow good practice. 
There are a number of ways that the readability of this code could be improved. 


¢ Use descriptive names for variables (e.g. userChoice instead of Num1). 
¢ Add blank lines between different blocks to make them stand out. 
¢ Add comments that explain what each part of the code does. 


- 


Programmers often work in teams; it is vital that any coding they share is clear and 
error free 


This table lists the techniques you should use to make your programs easy 
to read and understand. 


Technique —sC@esscrription 

Comments Comments should be used to explain what each part 
of the program does. 

Descriptive names Using descriptive identifiers for variables, constants 
and subprograms helps to make their purpose clear. 

Indentation Indentation makes it easier to see where each block 
of code starts and finishes. Getting the indentation 
wrong can result in the program not running or not 


producing the expected outcomes. Investigate how to write 
White space Adding blank lines between different blocks of code | comments in the high-level 
makes them stand out. language you are studying. 


Table 2.2 Techniques for program clarity 


Top tip. & Did you know? 
Here’s how to write comments in Python and Java. Microsoft® Windows® uses 
° Python: start each line of a comment with the # symbol. roughly 40 million lines of code 


~ code readability obviously 
becomes more and more 
important as programs get larger. 


¢ Java: start single line comment with // or start a multi-line comment 
with /* and end it with */ 
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Programming 


‘Activity 10 | 
1 Rewrite this algorithm using appropriate techniques to make it easier 
to read. 


SED, x70. 10 

WHILE x >= 0 DO 

tix. -0 

SEND x TO DISPLAY 

ELSE 

SEND ‘Blast Off’ TO DISPLAY 
END IF 

SET x TO % =i 

END WHILE 


Calculators have a number of useful 
functions 


2 Implement the improved algorithm in the high-level language you are 
studying. 

3. Design an algorithm for a simple calculator that: 
a allows the user to choose from these options: addition, 

subtraction, division and multiplication; 

b prompts the user to input two numbers; 
c performs the calculation and displays the result; 
d_ offers the user the option of performing another calculation. 

4 Implement this algorithm in the high-level programming language 
you are studying, ensuring you use techniques to make the program 
easy to read. 


XtraSave is a chain of supermarkets that has recently installed self- 
checkouts in all its stores. 

Customers are issued with one 10p voucher for each £10 they spend and 
one 5p voucher if they spend £5. 


The self-checkout system uses the following algorithm to print vouchers. 


01 RECEIVE amountSpent FROM (INTEGER) KEYBOARD 
02 WHILE amountSpent >= 5 DO 
03 IF amountSpent >= 10 THEN 


04 SEND voucherTen TO PRINTER 

oS SET amountSpent TO amountSpent - 10 

06 ELSE 

07 IF amountSpent >= 5 THEN 

08 SEND voucherFive TO PRINTER 

09 SET amountSpent TO amountSpent - 5 
10 END IF 

eh END IF 


Self-checkout machines are a common 
sight in supermarkets 


12 END WHILE 
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1 Identify the line number(s) that show one example of each of these 
structural components: (3 marks) 


This question is testing your 


a Selection ability to identify two of the main 
b= |teration programming constructs. Make 
c Variable initialisation sure you have identified the line 

2 Identify two techniques the programmer has used to improve the ‘Came oe ee 4 d 
readability of the code. (2 marks) and iteration constructs are found. 


3 Give two reasons for writing code that is easy to read. (2 marks) 


e Using comments, descriptive names, indentation and white space makes code easier to read. 
e Producing readable code makes it easier to understand what a program does and how it does it. 


$1 Explain why it is important to make your code easy to read. 
S2 Describe four techniques that a programmer should use to make code easy to read. 


C1 Revisit the programs you have already written. Do you still understand what they do and how they work? If 
not, try to improve their readability. 


How confident do you feel about your answers to these questions? If you're 
not sure you answered them well, look again at the table on page 55. 
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2.3 Strings 


Learning outcomes > 


By the end of this section you should be able to: 
* describe what a string is and explain what strings are used for. 


¢ use iteration to traverse a string. 
* concatenate and split strings. 


String: a sequence of characters. 
They can be letters, numbers, 
symbols, punctuation marks or 
spaces. 


Function: a function is a 
subprogram that performs a | 
specific task and can be used at 
any point in the program. High- 
level programming languages 

* have a number of useful built-in 

, functions. You can also create 

your own or use functions 


available in online libraries. 
a ee ee 


Index 0 


ee a 
String C O 


in p 


Table 2.3 Example string index 


Earlier in the chapter you learnt that a character is one of the four basic data 
types. A character can be a single letter, a symbol, a number or even a space. 


A sequence of characters is called a string. Although strings can contain 
different sorts of characters, including numbers, they are all treated as if 
they were text. 


When a computer executes a program it needs a way of telling the 
difference between a string and an instruction. In most programming 
languages this is achieved by enclosing strings in quotation marks (e.g. 
‘johnsmith@mail.com’, ‘10/04/15’ or ‘123’). Both single ‘’ and double “” 
quotation marks are acceptable, as long as they are used consistently. In 
the example “It’s her book”, the apostrophe is treated as part of the string 
because double quotes have been used to enclose the string. 


Strings are very useful when communicating with users. For example, asking 
them to enter some information into a program or displaying the output of 
a program in a format that humans can read and understand. 


String indexing 
Each character in a string has an index number, with the first character at 
position 0. You can use the index to reference individual characters in a string. 


For example, if the string myText holds the value ‘Computer Science’, then 
myText[1] references the letter ‘o’, myText[8] references the space and 
myText[11] the letter ‘i’. 


ae ee 6.1 -Taiees 10a; 71% 12 hs 14 15 
Li | e faa SN SN e n Cosi 
Length 


The Edexcel pseudo-code has a built-in LENGTH function, which you can 
use to find the number of characters in a string (e.g. in the case of the 
string ‘Computer Science’, SET numChars TO LENGTH(myText) assigns 
the value 16 to the variable numChars). 


The programming language you are studying will have a function for 


finding the length of a string. You should investigate what it is called and 
how it works. 
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| Design and write a program to check the length of a password. If the 

| Password entered is less than six characters, the program should output 
‘The password you have entered is not long enough’; otherwise it should 
output ‘Length of password OK’. 


String traversal 


You can use a FOR loop to cycle through each of the characters in a string. 
This is known as string traversal. 


The following algorithm prints out the word ‘monkey’ letter by letter, 
displaying each letter on a separate line. 


SET animalName TO ‘monkey’ 
FOR index = 0 TO LENGTH(animalName) - 1 


SEND animalName[index] TO DISPLAY 
END FOR | 


Concatenation 

As you learnt in Chapter 1 (on page 18), concatenation involves joining two 
or more items of information together. Concatenating two strings produces 
a new string object. Concatenation comes in very handy when displaying 
text on screen. 


The following piece of code asks the user to input their name and stores it 
in the variable userName. The string ‘Hello ‘ and the value stored in 
userName are concatenated using the & symbol, which is known as a 
concatenation or append operator. 


RECEIVE userName FROM (STRING) KEYBOARD 
SEND ‘Hello ‘ & userName TO DISPLAY 


Most programming languages have a number of different ways of 
concatenating strings. You need to investigate how it is done in the 
language you are studying. 


Type conversion 

Type conversion is performed when a string and a non-string (for example 
an integer) are joined together. The computer converts the non-string into a 
string before joining the two strings together. 


In the next example the string value stored in the variable learnerName 
and the integer value stored in the variable testScore are concatenated to 
form anew string, which is stored in the variable message. 


RECEIVE learnerName FROM (STRING) KEYBOARD 
RECEIVE testScore FROM (INTEGER) KEYBOARD 
SET message TO learnerName & testScore 
SEND message TO DISPLAY 


| String traversal: using a loop to 

| cycle through each character ina 

| string. 

Activity 12 ? 
Develop a program in the high- 
level language you are studying 
that asks the user to enter a 
sentence and outputs the number 
of vowels that it contains. 

You will need to initialise a 
variable for each vowel to keep 

a running total of how often it 
appears in the string. 


Investigate the string formatting 
methods available in the high- 
level language you are studying. 


Slicing 
Slicing is the process of extracting part of a string. This example algorithm 
extracts the string ‘foot’ from ‘football’. 


SET sport TO “football” 
SET newString TO ‘' 
FOR index FROM 0 TO 3 DO 
SET newString TO newString & sport[ index] | 
| END FOR 
| SEND newString TO DISPLAY | 


Investigate how to concatenate and slice a string in the high-level 
programming language you are studying. 


2 Acompany wants a program to generate usernames for new 
employees. Each username consists of the first four letters of the 
employee’s last name and the first letter of their first name joined 
together. If the employee's last name is less than four characters in | 
length a letter ‘X’ is used to fill in for each of the missing characters. 
Develop a program that asks the user to input their first and last 
names and outputs their username. 

3 Use string slicing to split this line of data into a set of seven strings. 
‘23456,West Kirby,04/11/15,23.0,11.5,30,D1’ 

The comma indicates where one string ends and the next begins and 
should be omitted when the string is sliced. 


String formatting 

Programming languages provide various methods of formatting strings to 
improve the way they are displayed on screen. They are particularly useful 
for formatting numbers, for example, by setting the number of decimal 
places to display. 


Strings are immutable; this means that they cannot be changed. It might 
appear that you can alter a string, but what really happens is that a new 
variable is created and replaces the old one, for example: 


SET newLearner TO ‘Robert Bennett’ 
SET newLearner TO ‘Sally Hadwen’ 


In the first line of this code extract the value ‘Robert Bennett’ is stored 
in a memory location to which the identifier newLearner points. In the 
second line, the value ‘Sally Hadwen’ is stored in a different location in 
memory, and the newLearner identifier is then updated to point to this 
new memory location. 
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e Astring is a sequence of characters. 


* Each character in a string has a unique index value denoting its position in the string. The first character in a 
string has the index value 0. 


¢ High-level programming languages have a built-in length function that finds the length of a string. 
¢ A loop is used to traverse a string, character by character. 

¢ Concatenation is the process of joining two or more strings together. 

¢ Slicing is the process of extracting part of a string. 


¢ String formatting is used to control the way text is displayed on screen. 


$1 Describe how individual characters in a string are referenced. 


$2 Design an algorithm that uses a loop to traverse a string. 
$3 Describe how a string and a non-string are concatenated. 


C1 Develop a program that asks the user to input a sentence and then splits it up wherever a space occurs. Each 
word should then be displayed on a separate line. 


How confident do you feel about your answers to these questions? If 
you're not sure you answered them well, have another go at the activities 
in this section. 
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2.4 Data structures 


| Learning outcomes he 


By the end of this section you should be able to: 

¢ describe the structure of one- and two-dimensional arrays and give examples of their use. 
* create and use one- and two-dimensional arrays in programs. 

¢ describe the record data structure and explain what it is used for. 


¢ design record structures. 


| | 


Data structure: an organised 
collection of related elements. 
Arrays and records are two 


common data structures used in 


programming. 


(icp ae: | 


Here’s how to define an array in 


Python and Java. 


Python 
Scores = [15, 


Java 


Deel] Scores = 


4, 50}; 


29, @, 50) 


foley eke 


A data structure is an organised collection of related elements. There 
are a lot of different data structures used in programming. You've already 
encountered one of them - strings. In this section we will investigate two 
more: arrays and records. 


Arrays 

As you learnt in Chapter 1, an array is an organised collection of related values 
with a single shared identifier. All the elements in an array are the same data 
type. Each has a unique index value denoting its position in the array. 


In the Edexcel pseudo-code, an array is initialised using the SET command. 
For example, this statement initialises an array called firstNames with four 
elements, all of type string. 


SET firstNames TO [‘Alex’, ‘Bryn’, ‘Eloise’, ‘Lois’ ] 


The square brackets denote the start and end of the array. The next 
statement initialises an empty array called winners. 


| SET winners TO [{] 


As with strings, the LENGTH function returns the number of elements in an 
array. In the example below numItems is assigned the value 4. 


| numItems = LENGTH(firstNames ) 


Looping through an array 
FOR loops come in handy when working with arrays. The following algorithm 


uses a FOR...EACH loop to iterate through the array firstNames, element by 
element, outputting each name in turn. 


SET firstNames TO [‘Alex’, ‘Bryn’, ‘Eloise’, ‘Lois’ ] 
FOR EACH name FROM firstNames DO 

SEND name TO DISPLAY 
| END FOR | 


Different languages have different ways of implementing arrays. For 
example, in Python the list data structure is used to implement an array. In 
Java arrays have to be declared prior to being used. 
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Searching and sorting 


In Chapter 1 you explored algorithms for searching and sorting elements of 
an array. 


The following code uses a linear search algorithm. Describe how it works. 


SET firstNames TO [‘Alex’, ‘Bryn’, ‘Eloise’, ‘Lois’, 
‘games’, ‘Sally’ ] 
RECEIVE searchName FROM (STRING) KEYBOARD 


SET found TO False 
SET index TO 0 


WHILE found = False AND index <= (LENGTH(firstNames) — 
1) DO 
IF searchName = firstNames[index] THEN 
SET found TO True 
END IF 
SET index to index + 1 
END WHILE 


IF found = True THEN 
SEND searchName & ‘ is at position ‘ & index & ‘ in 


the list’ TO DISPLAY 
BESE 

SEND searchName & ’‘ is not in the list’ TO DISPLAY 
BND DE 


Implement this algorithm in the high-level programming language you 
are studying. 


; 


Activity 16 ? 


In Chapter 1, Activity 22 you developed an algorithm that output the 
highest and lowest results. 

Implement this algorithm in the high-level programming language you 
are studying. 


High-level programming languages have a number of methods for working 
with arrays, including append (to add an item to an array), pop (to remove 

an item from an array) and reverse (to reverse all the elements in an array). 
They might also have a built-in sort method. 


Investigate the array handling methods available in the high-level 
programming language you are studying. _ 
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| When using a WHILE loop with 
an array, the loop terminating 
' condition is LENGTH - 1 rather 
_ than LENGTH. This is because 
the index of an array starts at 
0, so the index value of the last 
_ element of the array is one less 
than the number of elements in 
| the array. 


Arrays can be static or dynamic. 
A static array has a fixed length 
that must be declared when it is 
created (e.g. myFriends[5]).A 
dynamic array can grow as new 
items are added. It would be 
declared as myFriends[ ]. 


Two-dimensional array: a matrix 
, of rows and columns resembling 
a table. Two indices are used, 
| one to reference the rows and 
the other the columns. All the 


elements of a two-dimensional 
array share the same data type. 


Two-dimensional arrays 

So far, we have only considered one-dimensional arrays consisting of a 
single row of elements. But a lot of data comes naturally in the form of 

a matrix. For example, train and bus timetables, conversion tables and 
football league tables. They can be represented as two-dimensional arrays. 
In actual fact, a two-dimensional array is an array of arrays. Each row of a 
two-dimensional array is a one-dimensional array in its own right. 


Here is an extract from an array named examResults. It has three rows, 
each of which stores a set of four exam results. The mark of 47 is located at 
examResults[1, 2] -second row, third element along — and the value 80 
atexamResults [0, 0] - first row, first column. 


Initiating a two-dimensional array 

In the Edexcel pseudo-code, the SET command is used to initiate a two- 
dimensional array. Note the two sets of square brackets showing that 
examResults is an array of three elements, each of which is an array. 


SET examResults TO [[80, 59, 34, 89], [31, ll, 47, 64], 
(29, 56 eon 


_ Here is how to initiate a two-dimensional array in Python and Java. 
Python 
Scores = [[80, 59, 34, 89], [31, 11, 47, 64], (29, 56, 
Toot 

| Java 

tint ][] scores = {160,597 34, 89}; (31, eo 

1 Y2o 56,513, 91s) 


Looping through a two-dimensional array 

A pair of nested FOR loops is used to traverse a two-dimensional array: one 
to iterate through the rows and the other to iterate through the columns. 
The FOR loop that iterates through the columns is nested within the one 
that iterates through the rows. For each iteration of the outer loop, the inner 
loop iterates through all the columns in one row. This is demonstrated in the 
algorithm below. 


FOR EACH row FROM examResults DO 
FOR EACH column FROM examResults[row] DO 


SEND examResults[row, column] TO DISPLAY | 
| END FOR | 
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Activity 18 


1 Investigate how to initialise a two-dimensional array in the high-level All the data values stored in an 
programming language you are studying. array must be of the same data 
2 Write a program that creates and initialises an array to hold these five type. In this case they are all 
sets of marks. strings. You will have to convert 
80, 59, 34, 89 the scores to integers to find the 
31, 11, 47, 64 _ highest. 
29, 56, 13,91 
35).61, 48, 0 
Peaierol, 91 
3 Extend your program so that it calculates and displays the highest 
mark, the lowest mark and the average mark achieved. 


4 This two-dimensional array holds the highest score achieved by each 
' player in each of the three levels of an online game. 


Player Level Score 
Alex | 19 
Seema il 29 
Seema 2 44 
Lois 1 10 
Alex 2 1% 
Alex a 36 
Dion 1 25 
Emma wisp eZ 
Emma zZ 48 


Table 2.4 Example two-dimensional array 


Develop a program that initialises the array and then searches through it | 
to find the player with the highest score in each of the three levels. | 


Records 
We have already said that the elements of an array must all be the same ‘Keyterms 


data type. In contrast, the record data structure stores a set of related Record: a data structure that 


values of different data types. stores a set of related values of 


Each element ina record is known as a field and is referenced using a field name. eT Duce N eee: 


Field: an individual element ina 


Table 2.5 illustrates how the record data structure works. Each row of the table regard 


holds a set of information about a particular learner. Each column stores one 
item of information about the learner - their learner number, their age, their 
form etc. All the values in a column have the same data type - learnerNum 
and age are integers; firstName, lastName and formare strings. 


learnerNum _ firstName lastName age form 
1 Susan Smith i 1 10H 
2 Ben _ Roberts 14 ~~ :10B 
3 2 ei!) » 10A 
4 lan Wright 15 _ 10G 


Table 2.5 Example record data structure 
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Programming ae 


Programming languages vary in the way they handle the record data structure. 


Activity 19 

1 Arecord data structure is to be used to store the details of music 
albums. Give the appropriate data type for these fields: 
athe title of the album; 
b the name of the artist; 


c_ the year of release; 
d_ the genre. 


2 Develop a program that uses a record structure for storing the details 
of music albums. It must: 


It is essential to have a way to sort 
through the vast amounts of recorded 
music available b allow the user to input the details of new albums; 


a_ have fields for title, artist, year of release and genre; 


c allow the user to search for an album by name and display its 
details. 


The XtraSave chain of supermarkets collects data about product sales. 
The record for each product includes: 

e its unique ID (e.g. X14 or X98); 

e the total number sold in each month of the year. 

1 Identify a suitable data structure to store this data. (1 mark) 


2 Explain why the data type you chose is most suitable to hold this 
data. (3 marks) 

3 Drawa diagram of the data structure that shows the amount sold in the 
first three months of the year. Include data for at least two products. 
(3 marks) 


This question is testing your ability to select and design data 
structures. Make sure you have: 


* identified a suitable data structure to store the data and justified your 
choice; 


* provided the sample data requested (i.e. sales figures for two products 
for the first three months of the year). 


Row and column headings can be included as long as they are clearly 
distinguishable from the data. 


¢ A data structure is an organised collection of related elements. Arrays and records are common data structures. 


* Aone-dimensional array is a list of elements, each of which has a unique index value representing its position 
in the list. 


* Atwo-dimensional array is a matrix of rows and columns. Each element in the array has a unique pair of indices, 
one to identify the row and one the column in which it is located. 


¢ All the elements in an array have the same data type. 
* Arecord consists of a collection of fields. The values stored in a record can be of different data types. 


$1 What is the index of the first element in a one-dimensional array? 


S2 How does a linear search algorithm find an element in a one-dimensional array? 
$3 How is an element stored in a two-dimensional array referenced? 
S4 How is a nested IF used to traverse a two-dimensional array? 


C1 Develop a program for a simple address book that uses a one-dimensional array to store a set of names and 
email addresses, and allows the user to search for a person by name and returns their email address. 


C2 Develop a program that uses a two-dimensional array to represent a treasure map consisting of a grid of 4 
rows and 4 columns. 
A random number function should be used to establish the location of the treasure. 
The user must hunt for the treasure by repeatedly entering the coordinates of squares. The program should 
tell them when they have found the treasure and help them in their search by indicating how close they are. 


How confident do you feel about your answers to these questions? If you're 
not sure you answered them well, try redoing the activities on pages 63, 65 
and 66. 
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2.5 Input/output 


Learning outcomes bh 


By the end of this section you should be able to: 
¢ explain the need for validation. 


* write code that validates user input. 


e write code that accepts and responds appropriately to user input. 


* write code that reads to and writes from a text file. 


Validation: to check that the data 
entered by a user or froma file 
meets specified requirements. 


User input 


Most programs require some form of input either from a user or from a file. 
You already know how to receive user input from a keyboard. 


A program can be made much more ‘user friendly’ by displaying helpful 
messages informing users of what they are expected to enter and 
confirming successful input. 


Validation 

It is important to ensure that data entered by the user is valid, as invalid 
data can cause a program to behave unexpectedly or even stop altogether. 
Not surprisingly, ifthe data entered into a program is incorrect, the output 
it produces will also be wrong. This is sometimes called the Garbage In, 
Garbage Out (GIGO) principle. 


Any program that requires data entry should have appropriate forms of 
validation built in. But be warned, validation can’t guarantee that the data 
entered is correct. [t can only make sure that it is reasonable. 


There are a number of different types of validation. 


Range check 

A range check is used to ensure that the data entered is within a specified 
range. Study this algorithm, which checks that the number entered is 
between 1 and 10. 


BOOLEAN valid 
' SET validNum TO False 
WHILE validNum = False DO 


SEND ‘Please enter a number between 1 and 10:’ TO 
DISPLAY 


RECEIVE number FROM (INTEGER) KEYBOARD 
IF number >= 1 AND number <= 10 THEN 
SET validNum TO True 
END IF . 
‘END WHILE ~ 
SEND ‘You have entered: ‘& number TO DISPLAY | 
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The algorithm uses a Boolean variable named validNum as a status flag. It 

is initially set to False. The WHILE loop continues running until val idNum is 
equal to True. An IF statement determines if the value of ValidNum should 

be set to True. 


Validating user input 

Many programs require the user to choose from a list of options, for 
example ‘yes’ or ‘no’, or ‘a’, ‘b’ or ‘c’. In this situation it is important to check 
that a valid option has been chosen. 


BOOLEAN validChoice 
SET validChoice TO False 
WHILE validChoice = False DO 
SEND “Please enter either ‘Yes’ or ‘No’:” TO DISPLAY 
RECEIVE userChoice FROM (STRING) KEYBOARD 
IF userChoice='Yes’ OR userChoice='No’ THEN 
SET validChoice TO True | 
END IF | 
END WHILE | 
SEND You have selected: ‘& userChoice’ TO DISPLAY 


This algorithm requires the user to enter either ‘Yes’ or ‘No’ and uses an 
IF statement to validate their input. The WHILE loop is repeated until an 
appropriate choice is made. 


A problem with this algorithm is that the user must enter their choice 
exactly, in order for it to be accepted. For example, entering ‘yes’ instead of 
‘Yes’ would not be accepted. This issue can be addressed by altering the IF 
statement to accept more valid options. 


IF userChoice = ‘Yes’ OR userChoice = ‘yes’ OR 
userChoice = ‘No’ OR userChoice = ‘no’ THEN ... 


————E Ss 
4 
Many programming languages have functions that can be used to convert 


a string to upper or lower case. This can simplify the validation of strings, 
for example: 


Python 
if userChoice.lower() == ‘yes’ or userChoice.lower() == 


OsoKo)” 4 
validChoice = True 


Java 
if(userChoice.toLowerCase() == ‘yes’ || 
userChoice.toLowerCase() == ‘no’') { 


validChoice = true 
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Implement the range check 
algorithm in the high-level 
programming language you are 
studying. 


Design and implement a program 
for a user menu with these four 
options: 


1 display average temperature 
2 display temperature range 

3 display wind speed 

4 quit the program 

The program should ask the user 
to select an option and validate 
their entry. Entries in upper and 
in lower case should be accepted. 
When a valid option is selected 

it should be displayed on screen 
with an appropriate message. 


Activity 22 


Implement the length check 
algorithm in the high-level 
programming language you are 
studying. 


Implement the presence check 
algorithm in the high-level 
programming language you are 
studying. 


Length check 

It is sometimes necessary to check that the length of a value entered falls 
within a specified range. For example, all UK postcodes are between six and 
eight characters long, so validation could be used to check that the length 
of a postcode entered is within this range. Needless to say, even if it is, that 
doesn’t mean to say it is correct. 


RECEIVE postCode FROM (STRING) KEYBOARD 
SET length TO LENGTH(postCode) 
IF length >= 6 AND length <= 8 THEN 
SEND ‘Valid’ TO DISPLAY 
ELSE 
SEND ‘Invalid’ TO DISPLAY 
END IF 


Presence check 
Another type of validation is a presence check. This simply ensures that a 
value has been entered, preventing the user from leaving an input blank. 


This algorithm asks the user to input their name and uses a presence check 
to ensure they have entered a value. Any value will cause the loop to finish. 
It will keep asking the user to input their name until they input a value. 


' SET userName TO ‘' 
WHILE userName = ‘’ DO 
RECEIVE userName FROM (STRING) KEYBOARD 
END WHILE 
SEND ‘Hello ’‘ & userName TO DISPLAY 


Look-up check 


A look-up check is used to test that a value is one of a predefined set of 
acceptable values. The list of acceptable values can be stored in a one- 
dimensional array. 


This algorithm stores a list of valid form names in an array. It compares the 
form name entered by the user with the values in the array. 


SET Mrteyrorms 10) |‘ 7AXB’, ‘7PDB’, ‘“@eRL’, *7TJEH’ } 
RECEIVE form FROM (STRING) KEYBOARD 


SET valid TO False 
| SET index TO 0 
SET length TO LENGTH(arrayForms ) 


WHILE valid = False AND index < length DO 
IF form = arrayForms[index] THEN 
sol Valve, TOM Truc 
END IF 
SET index TO index + 1 
END WHILE 
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IF valid = True THEN 
SEND ‘Valid form’ TO DISPLAY 
ELSE 


SEND ‘The form you entered does not exist.’ TO DISPLAY 
END IF 


Menus 


Many programs feature menus that allow the user to choose which part of 
the program they wish to use. The simplest way to implement a menu is to 
give the user a list of numbered options to choose from and ask them to 
input the number that corresponds to the option they want to select. The 
user input can be validated using a range check to ensure that the option 
selected is permissible. A menu could be set out like the example below. 


Option 1: Find the highest value 
Option 2: Find the lowest value 

Option 3: Calculate the average 

Please select either 1, 2 or 3: 


This algorithm implements the menu shown above. 


SEND ‘Option 1: Find the highest value’ TO DISPLAY 
SEND ‘Option 2: Find the lowest value’ TO DISPLAY 
SEND ‘Option 3: Calculate the average’ TO DISPLAY 
validChoice = False 
WHILE validChoice = False DO 
SEND ‘Please select either 1, 2 or 3:’ TO DISPLAY 
RECEIVE choice FROM (INTEGER) KEYBOARD 
IF choice = 1 OR choice = 2 OR choice = 3 THEN 
SET validChoice TO True 
BND, PF 
END WHILE 


Testing validation rules 

It is important to test your validation rules to ensure they work as expected. 
You should use: 

Normal data - This is data that is within the limits of what should be 
accepted by the program. For example, a password with seven characters 
fulfils the validation rule that states that passwords must be between six 
and eight characters in length. 

Boundary data - This is data that is at the outer limits of what should be 
accepted by the program. For example, if a validation rule specifies that the 
range of acceptable values is >= 75 AND <= 100, then a value of 100 is at the 
upper limit and a value of 75 at the lower. 

Erroneous data — This is data that should not be accepted by the program. 
For example, a value of 0 should not be accepted by either of the validation 
rules given above. 
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Activity 24 
Implement the look-up check | 
algorithm in the high-level 
programming language you are 
studying. 


| Activity 25 


Add a menu to the program 

| you developed in Question 3 

of Activity 18. It must allow the 

user to select whether they want 

to see the highest, lowest or 
average mark achieved. 


Text file: a sequence of lines, 


each of which consists of a 
sequence of characters. 


Stage 1 


Stage 2 


Working with text files 

The programs you have created so far haven't required a huge amount of 
data entry, but imagine typing a set of test results for everyone in your 
computer science class. It would take a considerable amount of time to do 
and - even worse — when the program terminates all of the data will be 

lost. Should you need to use it again you'd have to re-enter it. This is where 
storing data in an external file comes in really useful. If the data you enter is 
stored in an external text file you can access it as often as you like without 
having to do any further keying in. 


Commas are used to separate individual values on a line and a special 
character is used to denote the end of a line. 


Text files provide permanent storage for data. This means that the data can 
be reused without having to be retyped. Data can be read from, written to 
and appended to a file. 


Here is an example of data stored in a text file that contains four learners’ 
examination results. Earlier in this chapter you saw how a two-dimensional 
array could be used to store this data. 


As you can see, each line of the file represents one learner's record, with 
commas used to separate values. 


| Lois Bennett,56,45,78,32 
Eloise Roberts,23,89,67,98 

| James Hadwen,65,43,83,45 

; Patrick Dua-Brown,21,43,38,29 


Reading data from a text file 


Data is read from a text file line by line, with each line being stored by the 
program as a string. Each string is then split into an array of strings using 
the comma as a separator. 


This algorithm reads the data from the external text file marks.txt and 
calculates the average score for the first test. It uses three predefined 
functions: APPEND, SPLIT and LENGTH. 


SET rawData TO [] 

FOR EACH record FROM marks.txt DO 
READ marks.txt record 
APPEND(record, rawData) 

END FOR 


SET examResults TO [] 
FOR EACH item in rawData DO 
SET nextLearner TO SPLIT(item, ‘,’) 
| APPEND(nextLearner, examResults) 
END FOR 


T2 


SET testTotal TO 0 | 
| FOR EACH learner FROM examResults DO | 


SET testTotal TO testTotal + INT(examResults[learner, 1}) 
END FOR 


SET testAverage TO testTotal / LENGTH(examResults) 


SEND ‘The average mark achieved in the first examination is: 
* & testAverage TO DISPLAY 


The algorithm carries out the task in three stages. 


Stage 1 

Each record in the file is read in and appended to the one-dimensional array 
rawData, using a predefined function APPEND. At the end of this stage the 
rawData array has four elements, each of which is a string. 


Stage 3 


Elements Lois Bennett. 56.45.7832 ‘Eloise Roberts,23,89,67,98' 


Stage 2 

A loop is used to traverse the array rawData, element by element. A 
predefined function SPLIT is used to split each element in turn into a set 
of strings, using the commas as separators. Each set of strings is assigned 
to the temporary array nextLearner and is then appended as a new row 
to the two-dimensional array examResults. At the end of this stage the 
examResults array looks like this. 


0 ‘Lois Bennett’ _ on po ae pe: oe bey 4 

1 ‘Eloise Roberts’ 3 | 89" jin Ela 
2 ‘James Hadwen’ | 6a5 ‘43’ Ci oa 
3. ‘Patrick Dua-Brown’ 21) 43) Se Peo ae 
Stage 3 


Finally the algorithm iterates through each row of the array examResults, 
converting the first set of marks for each learner to integers and totalling 
them. It then calculates the average mark achieved in the first examination 
and displays it on the screen. 


| 1 Implement the algorithm shown above in the high-level programming 

language you are studying. (You will need to create a text file in order 
to test the program. It should consist of at least ten records.) 

2 Extend your program to allow the user to choose which of the four 
examinations they want the average mark for. The program should 
accept valid user input only. 
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‘James Hadwen,65,43,83,45' ‘Patrick Dua-Brown,21,43,38,29’ 


Here is an example of how to 
read data from a text file in 
Python and Java. 

Python 

file = open(’scores.txt’, 
ir’) 

Java 


FileReader reader = new 
FileReader(‘scores.txt’); 


A text file can be used to store the 
students’ test scores 


Writing data to a text file 

The data stored in variables and arrays only remains in memory while the 
program is running, so if you want to retain the data after the program is 
exited you will need to write it to a file. 


SEND ‘How many learners do you want to enter results 
| for?’ TO DISPLAY 


RECEIVE numLearners FROM (INTEGER) KEYBOARD 


| FOR index FROM 1 TO numLearners DO 

RECEIVE name FROM (STRING) KEYBOARD 

RECEIVE markl FROM (INTEGER) KEYBOARD 

RECEIVE mark2 FROM (INTEGER) KEYBOARD 

RECEIVE mark3 FROM (INTEGER) KEYBOARD 

RECEIVE mark4 FROM (INTEGER) KEYBOARD | 
| WRITE marks.txt name, markl, mark2, mark3, mark4 
END FOR 


The algorithm shown above writes data to an external text file called 
marks.txt. It asks the user how many learners they want to enter marks 
for. A FOR loop is then used to enter a name and a set of four marks for each 
learner and to write this data as a new record in the file marks. txt. 


| Here is an example of how to write data to a text file in Python and Java. 


Python 
file = open(‘scores.txt’, ‘w’) 
file.write(name, markl, mark2, mark3, mark4) 
| file.close() 
Java | 
FileWriter writer = new FileWriter(‘scores.txt’, true); | 
_writer.write(name, markl, mark2, mark3, mark4); | 
writer.close(); | 


Data imported from a file needs to be validated in the same way as data 
entered by the user is validated. Records that do not meet the validation 
rules should be rejected and stored in another file (known as an error log) so 
the user knows which records haven't been imported. 
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Amend the program you developed in Activity 26 to include a validation 
rule that ensures that only valid marks between 1 and 100 are read in 
from the file. Any invalid records should be rejected and written to an 
error log. 


Make sure your text file includes some erroneous data so that you can 
test that the validation rule is working. 


The XtraSave chain of supermarkets collects data about the products it 
sells. It records the department, name and price for each product. The 
data is stored in a text file with one record per line. The file contains over 
8,000 records. Part of the file is shown below. 


Tinned, Baked Beans, 0.59 

Bakery, Wholemeal Bread, 1.50 
Drinks, 1L Apple Juice, 0.90 

Dairy, 1 Pt Semi-Skimmed Milk, 1.25 
Drinks, 1L Orange Juice, 0.90 


XtraSave wants to produce a list of all products in the drinks department 
of the store. 


Describe an improvement that could be made to the data from the file 
when used for this purpose. (3 marks) 


Supermarkets collect a great deal of information about the products they sell 


ii 


? 


Activity 28 ¢ 
When saving records to a file you 
can choose to write or append. 
Investigate the difference 
between the two and find out 
how they are implemented in 

the high-level programming 
language you are studying. 


* Validation techniques should be used to ensure that data entered by a user or from a file is valid. They can’t 
guarantee that the data is correct, only that it is reasonable. 


A range check is used to ensure that data is within a specified range. 

¢ A length check is used to ensure that data has a length within a specified range. 

¢ A presence check is used to ensure the user has entered some data. 

¢ A look-up check is used to ensure the data matches one of the values in a predefined list. 


¢ When users are required to choose from a list of options, their input should be validated to ensure that their 
choice is valid. 


* Large sets of data are normally stored in text files. The advantage of writing data to a file is that the data is not 
lost when the program is terminated. It can be read in from the file whenever it is needed. 


‘Strengthen 


S1 Describe what might happen if a program doesn’t include validation on user input. 
S2 Identify a data structure suitable for storing a list of values used in a look-up check. 
$3 Show how your name, date of birth and favourite colour would be stored in a text file. 
S4 Explain why it is advantageous to write data to a text file. 


Challenge = = = eens | 


C1 Develop a program that: 
* writes a set of employee records consisting of employee number, name and department to a text file: 
¢ reads in the stored records from the text file; 
¢ allows the user to search for an employee’s details; 
* uses appropriate validation. 


How confident do you feel about your answers to these questions? If you're 
not sure you answered them well, try redoing the activities in this section. 


2.6 Subprograms 


: ea>mning outcomes 


By the end of this section you should be able to: 

e describe what a subprogram is. 

* explain the benefits of using subprograms. 

* explain the difference between global and local variables. 

* explain the concept of passing data into and out of subprograms. 
* create subprograms that use parameters. 

* write code that uses user-defined and pre-existing subprograms. 


A subprogram is a self-contained module of code that performs a specific 
task. For example, a high-level programming language could provide a 
predefined subprogram that calculates the average of a set of numbers. A 
programmer can use this subprogram in any program they write without 
needing to know how it works. In other words, subprograms support the 
process of abstraction. 


Using subprograms reduces the complexity of programs and makes them 
easier to understand. It also saves time because the code only has to be 
written once but can be used many times, and makes programs easier to 
maintain. Furthermore, different subprograms can be allocated to different 
programmers, allowing large teams of programmers to work together on the 
same project. 


There are two types of subprograms: functions and procedures. 


Here is a function that simulates the throw of a die. 


FUNCTION die() # This is the start of the function. 


# This statement uses the predefined 
function RANDOM to generate a random 
number between 1 and 6 which is stored in 
the variable simDie. 


simDie = RANDOM(6) 


# This returns the value of the variable 
simDie to the main program. 


RETURN simDie 


END FUNCTION # This ends the definition of the function. 


| This function is called by the main program like this. 
| dieThrow = die() 


| The value returned by the function is stored in the variable dieThrow. 


Local and global variables 

Notice that there are two variables that store the random number generated 
by the function. In the function itself the variable simDie is used. This 
variable only exists within the function and is referred to as a local variable. 
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| Abstraction: the process of 
_ removing or hiding unnecessary 
| detail so that only the important 


points remain. 


Procedure: a subprogram 
containing a set of statements that 


are executed when the procedure 
is called. Unlike a function, a 
procedure does not return a value 
to the main program. 


Local variable: a variable that 
is accessed only from within the 
subprogram in which it is created. 


In a flowchart a subprogram is 
represented by this symbol. 


Die 


Figure 2.3 The symbol for a 
subprogram in a flowchart 


' Global variable: a variable that 


‘ 


can be accessed from anywhere 

in the program, including inside 
subprograms. 

| Parameters: values that are | 
passed into a subprogram when 

it is called. | 


Scope: the region of code within 
which a variable is visible. 


[ 
| 
| i 


Figure 2.2 Calculating the area of a cube 


In the main program the value returned by the function is stored in the 
variable dieThrow. It can be used anywhere within the main program and is 
therefore referred to as a global variable. 


Unlike a function, a procedure does not return a value to the main program. 


| PROCEDURE averageScore(scorel, score2, score3) 
| BEGIN PROCEDURE 
| SET total TO scorel + score2 + score3 
SET average TO total / 3 
SEND average TO DISPLAY 
END PROCEDURE 


The name of this procedure is averageScore and its parameters are 
scorel, score2 and score3. These are values that the procedure needs | 
to do its job. Data is passed into the subprogram by placing it in brackets | 
after the subprogram name, e.g. | 
averageScore(20, 24, 19) | 
This statement calls the procedure and passes the values 20, 24 and 19 

into it. In this instance, the value stored in the variable average once the 
procedure has finished executing is 21. 


Both functions and procedures can have parameters. An empty pair of 
brackets indicates that a subprogram does not have any parameters. 


1 Investigate how subprograms are implemented in the high-level 
programming language you are studying. 


2 Write the die function and the averageScore procedure in the high- 
level programming language you are studying. | 


Local and global variables have different scope. A local variable is only 
available within a subprogram, whereas a global variable can be accessed 
anywhere within the program. 


| REAL cubeSurfaceArea 
| PROCEDURE cubeSurfaceCalc(width, height) 
BEGIN PROCEDURE 
| SET cubeArea TO width * height 
SET cubeSurfaceArea TO cubeArea * 6 
END PROCEDURE 


In this example cubeArea is a local variable because it is declared within 
the procedure cubeSurfaceCalc. It therefore cannot be accessed by the 
main program. On the other hand, cubeSurfaceArea is a global variable 
because it is declared in the main program. It can be accessed throughout 
the program, including from within the subprogram. 
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1 Implement the cubeSurfaceArea procedure in the high-level If a local variable and a global 
programming language you are studying and test it with the call variable share the same name 
cubeSurfaceArea(10, 5). | the subprogram will use the local 

2 Write a function that will input an array of numbers and return the variable and the global variable 
highest and lowest numbers. | will not be changed. | 


CONST REAL discount 
SET discount TO 0.1 


FUNCTION discountPrice(total) 
BEGIN FUNCTION 
IF tetal >= 50 ZHEN 
SET saving TO total * discount 
SET newTotal TO total - saving 
RETURN newTotal 
ELSE 
RETURN total 
END IF 
END FUNCTION 


RECEIVE subTotal FROM (REAL) KEYBOARD 
SET salePrice TO discountPrice(subTotal) 


| SEND salePrice TO DISPLAY 


_ This algorithm is used to calculate the sale price of a customer's shopping. 
Identify the local variables, global variables and constants it uses. 


Along with variable and type declarations, command sequences, selection, 
iteration and data structures, subprograms are one of the essential structural 
components of a program. Needless to say, just as with variables and data 
structures, it is important to give subprograms meaningful names. This will 
make what they do more obvious and will make your programs easier to read. 


Extend your knowledge 


When a subprogram calls itself this is known as recursion (see Chapter 1, 
page 33). 


A recursive subprogram always has at least two pathways — one that leads 
to the subprogram being called again and one that leads to a terminating 
condition being met. In Chapter 1 you learnt how recursion is used ina 
merge sort algorithm to repeatedly divide a list in two. The terminating 
condition is reached when the size of each list is one. 
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This question is testing your ability 
to identify the seven key structural 
components of a program. Make 
sure you have identified the line 
numbers in which the components 
can be found. 


Built-in functions: functions 

that are provided in most high- 

level programming languages to 
_ perform common tasks. 


SET valuesArray TO [3, 9, 12, 16, 4, 93] 
REAL max 


FUNCTION maxCalc(values) 
BEGIN FUNCTION 
SET length TO LENGTH(values) 
SET max TO 0 
FOR index = 0 TO length - 1 DO 
IF values[index] > max THEN 
SET max TO values[index] 
END IF 
END FOR 
RETURN max 
END FUNCTION 


SET max TO maxCalc(valuesArray) 
SEND max TO DISPLAY 


| 1. Identify line number(s) that show one example of each of these 
structural components of a program: (6 marks) 


Donan oe 


2 Identify the line number where a subprogram call is made. (1 mark) 


variable initialisation 
type declaration 
selection 

iteration 

data structure 
subprogram 


Built-in functions 


In addition to user-written subprograms, most high-level programming 
languages have a set of built-in functions for common tasks. These are 
designed to save the programmer time, such as functions that round 
numbers, count the number of characters in a string and generate random 


numbers. You have come across some of these already. 


The Python programming language has over 65 built-in functions. 


Investigate the built-in functions provided by the high-level programming 
language you are studying. 


If the 


always use one from an external library. These provide more specialised 
functions. For example, libraries for game development often include 


re is not a built-in function that meets your needs you could 


functions for collision detection. 
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Th 


e self-checkouts used by the XtraSave chain of supermarkets allow 


the user to scan the barcode on each product. Once the user has 
scanned an item, its details are retrieved and passed to the subprogram 
processItem. The parameters of the subprogram are: itemName, 


ae 
Th 


emPrice and itemWeight. 
e subprogram must: 
display the name and price of the item; 


every five seconds ask the user to place the item in the bagging area 
until they have done it; 


check whether the item has been placed in the bagging area by 
comparing the itemWeight to the totalWeight (weight of all the 
items in the bagging area). 


A global variable, totalWeight, holds the overall weight of the items in 
the bagging area and is updated each time a new item is added. 


Th 


e delay() subprogram has already been created. It allows program 


execution to be paused for five seconds. 


1 
2 


Design the processItem subprogram using pseudo-code. (9 marks) 
State the name of a parameter that is used in this algorithm shown 
below. (1 mark) 

State the value that will be displayed on screen after line 12 is 
executed. (1 mark) 


State the value of the variable area after line 6 is executed. (1 mark) 
State the value of the variable area after line 11 is executed. (1 mark) 


REAL area 


FUNCTION areaCalc(width, height) 
BEGIN FUNCTION 
REAL area 
SET area TO width * height 
RETURN area 
END FUNCTION 


SET area TO 5 
SET rectangleArea TO areaCalc(5,10) 
SEND rectangleArea TO DISPLAY 


Use a conditional loop to ask 
the user to place the item 

in the bagging area. The 
condition should check whether 
the difference between the 
totalWeight value at the 

start and current value of 
totalWeight is equal to the 
totalWeight. 


There are two variables called 
area in this program, one 

is a global variable and the 

other is a local variable. You 

need to demonstrate that you 
understand how the values of the 
two variables change during the 
execution of the program. 


¢ Asubprogram is a section of code within a larger piece of code that performs a specific task. It can be used at 
any point in the program. 


¢ A function is a Subprogram that returns a value to the main program. 

¢ A procedure is a subprogram that does not return a value to the main program. 

¢ Parameters are values that are passed to a subprogram when it is called. 

¢ Local variables can only be accessed from within the subprogram in which they are created. 

¢ Global variables can be accessed anywhere in the program, including inside subprograms. 

¢ Built-in functions are functions provided in a high-level programming language to perform common tasks. 


$1 Explain the benefits of using subprograms. 


$2 Explain with examples what is meant by the scope of a variable. 

S3 Describe what happens when a global variable and a local variable share the same name. 

S4 Give an example of a common built-in function. 

Challenge ———— ee 

C1 Design and implement a calculator program that: 
* allows the user to enter a set of numbers; 
* uses separate functions to calculate the mean, mode and median; 
* allows the user to select which function they want; 
* uses appropriate validation. 


How confident do you feel about your answers to these questions? If you're 
not sure you answered them well, try redoing the activities in this section. 
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2.7 Testing and evaluation 


Learning outcomes bh 


By the end of this section you should be able to: 

* describe different types of error in programs. 

¢ design and use test plans and test data. 

* determine what value a variable will hold at a given point in a program. 

* evaluate the strengths and weaknesses of a program and suggest improvements. 


ae 


From th t fe i Oj inki 
the outset of a programming project you should be thinking about Tea erseGlinEien Rates 
what you need to achieve and how you will test that the program functions (aunchedttrom Case Canameral in 
as expected. Testing should be an integral part of each stage of the Decembeniies i Rperor niate 
program, starting at the design stage, during implementation and once than USmeecotrmiihon: ericson 
you think the program is finished. This is the only way of ensuring that the , 
ae was to provide valuable climate 

program functions correctly and meets all of the specified requirements. information about Mare: C@akieiieer 


It is very rare for a programmer to develop a program that executes exactly due to an error in the ground- 
as intended first time. There are nearly always errors that need to be found based computer software, Orbiter 
and fixed. You've already come across two types of errors: logic and syntax. entered the planet's atmosphere 
at the wrong entry point and 
Logic errors disintegrated shortly thereafter 
As you learnt in Chapter 1 (on page 21), logic errors occur when the thinking without having sent back any 
behind an algorithm is incorrect so that the output isn’t what is expected or information. Had the software 
intended. Ideaily. logic errors should be identified and fixed at the design stage. been properly tested we might 


now be much better informed 
about the possibility of there 
being life on Mars. 


The following algorithm is intended to work out whether a learner has passed 
a test. Learners need a score of 80 or above to pass. However, a logic error in 
the algorithm means that it produces an incorrect and unexpected result. 


IF testScore <= 80 THEN 
SEND ‘Pass’ TO DISPLAY 
ELSE 
SEND ‘Failed’ TO DISPLAY 
END IF 


Trace tables 
You saw in Chapter 1 (on page 22) how a trace table can be used to identify 
logic errors at the design stage. 


SET table TO 5 
FOR index FROM 1 TO 5 DO | 
SET answer TO index * table | 
| SEND answer TO DISPLAY 
END FOR 


How many female students are there in 
this class? 


This algorithm displays part of the five-times table. Here is its trace table. It 
records each variable change that occurs during execution of the algorithm. 
Each time a line of code alters the value of a variable or variables a new row 
of the trace table should be completed. 


] 

5 
Z 

10 
g 

is 
4 

20 
2) 

() 


SEY gender TO -["M", °M’, 9P yan eee ee ee | 
Me eRe 
SET length TO LENGTH( gender) 

SET count TO 0 

SET index TO 0 


REPEAT 
IF gender[index] = ‘F’ THEN | 
Saw CoOwre WO) Gewiwe sr i 
END IF 
SET index TO index + 1 | 
| 


UNTIL index = length 


This algorithm is intended to count the number of female learners in a class. 
Create and complete a trace table for this algorithm. 


L& Python 3.4.1: Santen - C:/Python/example3.py _ Oo x 
File Edit Format Run Options Windows Help 
favSubject=input ("What is your favourite subject?") _ ba 
if favSubject=="Computer Science" 

print ("That's my favourite subject too") 
else; 


U 


print ("Interesting, I prefer Computer Science") 
Figure 2.4 A syntax error in Python code 


Syntax errors 


Syntax errors, as we saw earlier (page 48), occur when the grammar rules of 
a programming language are not adhered to. 
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The Python program in Figure 2.4 is designed to ask the learner to enter their 
favourite subject. If they enter Computer Science the program should display 
‘That's my favourite subject too’, otherwise it should output ‘Interesting. | prefer 
Computer Science’. Unfortunately, a syntax error prevents the program from 
executing. The rules of the language state that a colon must be placed at the 
end of a line before an indented block and this is missing from the second line. 


Syntax errors are normally identified when the algorithm is being implemented 


in a high-level programming language because they prevent the code from 
being translated. 


Runtime errors 


Runtime errors occur during program execution and are the most difficult to Runtime error: an error that 
predict and spot. occurs while the program is 


This algorithm is designed to take two numbers, divide the first number by fe aes DEcrelon we 
computer is asked to do is 


the second number and output the result. When implemented in a high-level 

aa , : impossible to execute. 
programming language it will work as intended at least some of the time. : 
However, if the user entered 5 and 0, a runtime error would occur because it is 

impossible for the computer to divide 5 by 0. 


RECEIVE firstNumber FROM (INTEGER) KEYBOARD 
RECEIVE secondNumber FROM (INTEGER) KEYBOARD | 
SET result TO firstNumber / secondNumber 

SEND result TO DISPLAY | 


Figure 2.5 shows another example of a runtime error. A message is shown 
when letters are entered into the program where integers are expected. 


L@ Python 2.41 Shea = o x 
File Edet Shell Debug Optons Windows Help : o 
Python 3.4.1 (v3.4.1:c0e3lie010fc, May 18 2014, 10:45:13) [MSC v.1600 64 bit (aM <1 
D64))] on win32 
Type “copyright”, “credits” or "license()" for more information. 
>>> sssssssssesessssssssssssss=sesa= RESTART =sss=sssssssssssscasasessse=sss5 
a> 
Enter the first number: a 
Traceback (most recent call last): 

File "C:/Python/examplel.py", line 1, in <module> 

Numl=int (input ("Enter the first number: ")) 

ValueError: invalid literal for int() with base 10: '‘'a' 
>>> | 


Ln: 10 Col: 4 


Figure 2.5 An example of a runtime error in Python 


Error summary 
This table summarises the three types of error you are likely to encounter. 


| Logic | The program seems to run normally; however, there is an error in the logic of the program, which 
| | means it does not produce the result you expect. 

Syntax Syntax refers to the rules of the programming language. A syntax error means that part of the 
| ed. code breaks the rules of the language, which stops it running. 

Runtime An error that occurs when the computer tries to run code that it cannot execute. 
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Integrated Development 
Environment (IDE): a package 
that helps programmers to 
develop program code. It 

has a number of useful tools, 
including a source code editor 
and a debugger. One of the most 
useful features of an IDE is the 
debugger. One of its tasks is to 
flag up syntax errors in the code 
and issue helpful error messages. 


| 


Here are some error messages that you are likely to see in popular IDEs. 
IDLE (A popular Python IDE) 


RECEIVE width FROM (INTEGER) KEYBOARD 
RECEIVE height FROM ( INTEGER) KEYBOARD 
SET area TO width / height 

SEND ‘The area is: ‘ & area TO DISPLAY 


1 Givea value for the variable height that would cause a runtime error. 
Implement this algorithm in the high-level programming language 
you are studying, correcting the error you identified in Question 1 and 
using validation to prevent a runtime error occurring. 


SET subjects TO [’English’, ‘Maths’, ‘Science’, 
‘Computer Science’ ] 
FOR index FROM 0 TO 4 DO 


SEND subjects{index] TO DISPLAY | 
END FOR | 
I 


Identify the runtime error that will occur when this algorithm is 
implemented in a high-level programming language and executed. 


Using an Integrated Development Environment (IDE) 


You probably already have first-hand experience of using an Integrated 
Development Environment (IDE) when writing code. It’s definitely worth 
taking some time to get to know the IDE that comes with the language you 
are using. Useful features such as syntax highlighting, code auto complete 
and auto indent will help to make your programming experience — especially 
at the outset — far less stressful. 


It is really important that you get lots of practice interpreting error 
messages and fixing errors in your code. 


* SyntaxError: invalid syntax — part of the code breaks one of the rules of the programming language 
¢ IndentationError: expected an indented block - statements after a colon must be indented 

* TypeError: Can't convert ‘int’ object to str implicitly — trying to join a string and an integer together 

¢ ZeroDivisionError: integer division or modulo by zero — trying to divide a value by 0 


NameError: name is not defined - referring to a variable or subprogram that does not exist 
Eclipse (A Popular Java IDE) 


* ; expected — each statement should end with a semicolon 
¢ Cannot find symbol - referring to a variable or subprogram that does not exist 


. Incom patible types - trying to mix data of different types, for example a string and an integer 
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a 


file Edrt Format Rur Options Windows Heip 

Numl=int (input ("Enter the first number: ")) sh 
Num2=int (input("Enter the second number: ")) 

Sum=Numi +Num2 


‘ ‘2 Seeatanbress x 
pete ( zzz es 


x] EOL while scanning string literal 


| <_ «| ’ 
Figure 2.6 A syntax error flagged up by an IDE in Python code 


The IDE has flagged up a syntax error in this program. 


It has identified that the programmer has started a string but has not 
indicated where it finishes. The rules of the language state that strings must 
be enclosed in speech marks. 


The test plan 

At the start of a programming project it is absolutely crucial to make a list 
of the requirements that the program is expected to meet. Throughout the 
development phase of the project you should regularly refer back to this list 
of requirements to check that you are on track to achieve them. 


Deciding how to test the finished program to make sure that it fully meets 
the requirements can’t be left to the last moment either. As part of the 
design stage, you should create a test plan listing the tests you will carry 
out, the data that will be used for each test and the expected result. 


RECEIVE examMark FROM (INTEGER) KEYBOARD 


IF examMark >= 80 THEN 
SEND “A’ "TO DISPLAY 


jg6,S)8, 
IF examMark >= 70 THEN 
SEND ‘B’ TO DISPLAY 
ELSE 
IF examMark >= 60 THEN 
SEND ‘C’ TO DISPLAY 
BSE 
IF examMark > 0 THEN 
SEND 'D' TO DISPLAY 


ELSE 
SEND 'FAIL' TO DISPLAY 


END IF 
END IF 
END IF | 
END IF 


This algorithm converts an exam mark into a grade. It should only accept 
marks between 0 and 100. 
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The test plan extract below shows some of the tests that have been planned 
for the finished program to ensure that it meets all the requirements. 


To check correct conversion of valid 
| mark 


b = a re —, = arate * i. 
| To check correct conversion of 
boundary mark 


peel 


To check response to erroneous mark | 


0 ‘FAIL 

oP) iD; 

65 ic 

13 | 'B 

85 A 

0 | FAIL 

ee D 

60, 69 i 

TAO RES, 'B 
/ 80,100 | A 

ao | Error message 
| 105 | Error message | 


Only the first four columns of the test plan table can be completed at the design 
stage. The remaining columns are filled in when the program is complete. 


It’s unlikely that you'll be able to anticipate at the outset every test that will 
be needed to ensure the program works as intended. The test plan is not a 

fixed document. If additional tests are required they should be added to the 
test plan. 


lf a test produces the expected result you can simply write ‘None’ in the 
‘Action needed/comments’ column. If, however, that is not the case then you 
should instead note what went wrong and what you did to put it right. 


It is important to select suitable test data for the tests. Test data falls into 
three different categories: normal, boundary and erroneous. (See also 
validation tests on page 71 in this chapter.) 


Normal | Data that is well within the 
data | limits of what should be 
_ accepted by the program. 
Boundary Data that is at the outer 
data | limits of what should be 
| accepted by the program. 
| Erroneous Data that should not be 
Table 2.6 Test data categories | data | accepted by the program 


Test 1 uses normal data to check 
if marks are converted into grades 
correctly (e.g. a mark of 65 should 
be converted to a grade C). 


Test 2 uses boundary data to check 
that the program works correctly 
with marks at the upper and lower 
boundaries (e.g. a mark of 60 anda 
_ mark of 69 should both convert toa 
_ grade (). 


Test 3 uses erroneous data to check 
_ that the program does not accept it. 


Testing is just as important as writing code because it ensures that the 
finished program works correctly and fully meets the requirements. You 
should use a ‘bottom up’ approach to testing (i.e. test each subprogram as 
you develop it and then test the whole program once it is finished). 
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Here is the completed test plan for the grade calculator program. As you can see, the expected result was not 
produced when the program was tested with erroneous data. A range check had to be added to the program to ensure 
that only marks between 0 and 100 can be entered. 


ri To check correct 
conversion of valid 
score 
Z To check correct 


conversion of 
boundary score 


g To check response to 


erroneous score 


| 


FAIL "FAIL 
D D’ 
a C 
B iB 
‘A ny 
FAIL! FAIL 
By | 'D' 
ay | ie 
iB B 
1A oN 
Errormessage | ‘FAIL’ 
Error message | ‘A’ 


| None 


None 


| Arange check has been added — 
| to ensure that only valid marks | 
| can be added. | 


After you have carried out all of the tests and made all the necessary 
changes, the program should be retested to ensure that the improvements 
you have made haven't introduced any new errors into the program. 


Evaluating programs 


You need to be able to identify the strengths and weaknesses of your own 
programs as well as those created by other programmers. This will enable you 
to identify techniques that work well and aspects that could be improved. 


i 
Usability — is the | 
program easy to use? | 


The efficiency of 
the program — have 
loops and subprograms 
been used to avoid 
unnecessary code 
repetition? 


Requirements ~ 
does the program meet 
all the requirements 
set at the start of 
the project? 


5 


Evaluating programs 


| 
| : = 
: Validation — has 


| appropriate validation 
| been applied to user input? 


Code readability — 
have meaningful 
identifiers, comments 
and white space been | 
used appropriately? | 


Se 


Figure 2.7 Consider the aspects shown above when evaluating a program. 
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1 Design an algorithm for a 
simple guessing game. The 
algorithm must: 


a generate a random 
number between 1 and 6; 


b ask the user to input a 
number between 1 and 6; 


c reject any values outside 
the acceptable range; 


d_ display ‘Well Done’ if the 
user guesses correctly or 
‘Try Again’ if their guess is 
incorrect. 
2 Design a test plan for this 
guessing game program. 


3 Write the guessing game 
program in the high-level 
programming language you 
are studying. 


4 Test your program by 
carrying out the tests 
you planned. Ensure you 
update your test plan after 
completing each test. 


sia | 


This question tests your ability to 
trace an algorithm using a trace 
table. Remember each variable | | sev count TO 0 
change should be recorded 
and each time a line of code 


SET scores TO (45, 67, 34, 98, 52] 
SET length TO LENGTH(scores ) 


FOR index FROM 0 TO length — 1 DO 
IF scores[index] >= 50 THEN 


alters the value of a variable or | SET count TO count + 1 | | 
variables a new row of the trace | END IF 
table should be completed. | END FOR 


1 Describe what this algorithm does. (2 marks) 
2 Draw and complete a trace table for this algorithm with these column 


headings: 

a_ length 

b count 

cindex 

ds scores[index] (5 marks) 


¢ Logic errors occur when there is an error in the logic of the code, causing the program to produce an 
unexpected result. 

¢ A syntax error occurs when part of the code breaks the rules of the programming language. 

¢ Aruntime error occurs while the program is running and it is asked to do something that is impossible to do. 

¢ Trace tables can be used to manually trace the execution of an algorithm, allowing you to track the changes in 
variable values. 
Before creating a program it is important to produce a test plan that outlines how the final program will be 
tested to ensure it meets the requirements. 
Evaluating a program involves considering its strengths and weaknesses and identifying areas for improvement. 


$1 Describe the three types of error associated with program development and identify the stage(s) of 
development at which they are most likely to occur. 


S2 Describe the function of a trace table. 

$3 Describe the features of an IDE that help programmers write error-free code. 

S4 Describe the function of a test plan in program development. 

$5 Describe what is meant by normal, boundary and erroneous data. 

S6 Explain why it might be necessary to retest a program once all the planned tests are completed. 


Challenge == _ i) 
C1 Identify the requirements for a program, design a solution and draw up a test plan for it. 


C2 Design and implement the program. Conduct your planned tests and make any necessary changes to your 
program, ensuring your test plan is kept up to date. 


C3 Evaluate your program. 


How confident do you feel about your answers to these questions? If you're not 
sure you answered them well, try completing the activities on pages 84, 86 and 89. 
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3.1 Binary 


Learning outcomes he. 


By the end of this section you should be able to: 
° explain why computers use binary to represent data and program instructions. 


* describe how computers represent and manipulate numbers. 


* convert between binary and denary whole numbers (0-255). 
* perform binary arithmetic and understand the concept of overflow. 
explain why hexadecimal notation is used and be able to convert between hexadecimal and binary. 


Binary: information represented | 


by only two values (e.g. a voltage 
or no voltage, on or off). 


| 


There are no communication errors 
or misunderstandings because 
there are no small differences. 


Digital: information represented 
by certain fixed values (e.g. 
high, medium or low). Any signal 
between these values would be 
meaningless and not used. 


Sending and receiving 
mechanisms do not have to 
be as accurate as for analogue 
communication. 


Analogue: using signals or 
information represented by a 
quantity (e.g. an electric voltage 
or current) that is continuously 
variable. Changes in the 
information being represented are 
indicated by changes in voltage. 


This method requires very accurate 
sending and receiving mechanisms. 


Why binary? 
Binary is needed to represent data and program instructions because of the 
way in which computers work. 


The processor, which processes all of the data and instructions, contains 
billions of transistors which are connected together to form circuits. 


The transistors act as switches, similar to light switches. They have only two 
states, on or off — they either transmit an electric current or they do not. A 
system with separate states is said to be digital and if there are two states 
it is binary. There are no in-between states with different levels of current 
as there would be in a dimmer switch, which produces different levels of 
brightness in a bulb. A system such as this, where there is a continuous 
range between two values, is said to be analogue. 


Humans have digital systems. The nerves which control all of our actions 
are a digital system. They transmit nerve impulses — bursts of electricity 
from the sense organs to the brain or from the brain to the muscles. 
These impulses are all of the same intensity. These impulses are all of the 
same intensity and the strength of the stimulus causing the impulses is 
not signalled by the size of the impulses but by their frequency, 


As there are only two states, on or off, the states are represented by the 
digits of the binary number system, 1 and 0. 


All of the data and program instructions processed by a computer are 
nothing more than streams of millions of 1s and Os. 


Numbers, text, graphics and sound are all represented in the same Way, as a 


series of 1s and Os. The program instructions that the processor is following 
allow it to interpret them in different ways. 


Representing information 


There are only two digits, 1 and 0, but to represent the symbols in text there 
must be at least 52 separate items of information - 26 lower-case and 26 
upper-case letters. Then there are punctuation symbols — the full stops, 
question marks, etc. 


When we write, we combine the letters in our alphabet to create words and 


combine these into sentences to convey meaning. \ Rey errsramaaiae amas“ 


In a similar way the two binary digits (bits) can be combined into groups. | Binary digit (bits): the smallest 

unit of data that is represented in 
a computer. It has a single binary 
value, either 1 or 0. 


A single bit offers two possibilities. In a graphics program it could give 

black (1) or white (0), for example. If we wanted colour we’d need more 
combinations. If two bits are used this gives four possible combinations, 00, 
01, 10, 11, each of which can represent a colour, so four colours could be used. 


Three bits would give eight combinations, 000, 001, 010, 011, 100, 101, 110, 
111, allowing eight different colours to be represented. 


With four bits there would be 16 combinations. 


There is obviously a mathematical relationship between the number of bits 
used and the number of possible colours. 


e With 1 bit there are 2 possible colours. 

¢ With 2 bits there are 4 possible colours. 
« With 3 bits there are 8 possible colours. 
e With 4 bits there are 16 possible colours. 


The number of possible colours is equal to 2 to the power of the number of 
bits used in the combination. 


A ald 

© 2=272=4 
oe CaS 
oe ce 


Number systems 
Binary is a number system based on two digits, 0 and 1. 


We are more used to the denary system which has ten digits, 0 to 9, but the 
binary system functions in the same way. 


The difference is that the denary system works in powers of 10, whereas the 


binary system uses powers of 2. 


Denary system 
Every digit has a place value and the one to the left has a value 10 times 
higher than the one to the right. 
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Byte: the basic combination of 

| bits used to represent an item 
of information. A byte typically 
consists of 8 bits. 


Activity 3 ? 


Calculate the denary equivalents 
of the following binary numbers. 


a 00111001 
b 11000110 
c 10101010 


The following table shows the place values for the digits in the number 3,639. 


Place values | los | 10° 10° 10° 
1000 100 10 1 
3 6 3 ) 


Therefore 3,639 can be written as: 
(3 * 1000) + (6 * 100) + @ * 10) + (9* 1) 
Binary system 
The binary system has similar place values but they increase by powers OFZ. 
The following table shows the place values for the digits in a byte. 
Place values 2’ 2° a 2% 2 2: a 2° 
128 ; 64 32 16 8 4 Z 1 


Converting binary to denary 
The table above can be used to convert a binary number to a denary one. 


The denary equivaient of the binary number 10101101 can be calculated as: 
Place values _ oe rt 24 2 2? 21 2° 
128 64 32 16 8 4 2 | 
il 0 1 0) ] 1 0 1 


(1 * 128) + (1 * 32) + (1 * 8) + (1 * 4) + (1 * 1) = 128+ 32+84+44+1=173. 


Converting denary to binary 
The table of place values can be used to convert denary numbers to binary. 


Convert denary 213 to binary. Use the table and fill it in as you calculate 
the equivalent. 


Placevalues 27 26 | 25 | 2* | 2) 2? gt 
128/64 |32 |16 |e ae ota 


Move from left to right: 


Step 1: Is the number (213) equal to or greater than 128? If it is place a ‘1’ 
at this place value and if not place a ‘0’ 


In this case 213 is greater than 128 so place a ‘1’ in this column and 
calculate the remainder. 


Place values 2’ 2 a Z 23 2? 21 20 
128 64 32 16 8 4 2 1 


The remainder is 85. 
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Step 2: Is 85 greater than or equal to 64? Yes, therefore place a ‘1’ in this column. 
Place values | 2’ 


Ee a Ea eee 
1128 |64 (32 |16 |8 |4 |2 [4 
ae 


The remainder is 21. 


Step 3: Is 21 greater than or equal to 32. No, therefore place a ‘0’ in this column. 


Piace values} 2’ 0p 2° (2° [2% | 25 | 22 |at | 2° 
12s [64 (32 |16 |8 [4 [2 |1 
od EE: 


The remainder is still 21. 


Step 4: ls 21 equal to or greater than 16? Yes, therefore place a‘1’ in this column. 


Place values 2’ 2° | 


2 2 | 2 
128 64 ‘32 |16 | 

to 17 bo |a 
The remainder is 5. 


Coe and Pre 
8 14 2 1 


Step 5: Is 5 equal to or greater than 8? No, therefore place a ‘0’ in this column 
Pricevalues} 2’ 42° f2° 12°. | 2 | 2° | 2; | ra 

(128164 132 |/16 18 |4 |2 |4 
] 1 0 1 


The remainder is still 5. 


Step 6: Is 5 equal to or greater than 4. Yes, therefore place a ‘1’ in this column 


Macevalues}2’ j2° j2 |2& |2 2 |2 |2 
Mele, (32 [16 j8 (4 | 


The remainder is 1. 


Step 7: Is 1 equal to or greater than 2? No, therefore place a ‘0’ in this column 
Place values | 2? 12° . | = 26 | 23 | 22 | 21 | 20 
128 64 32 16 


is |4 [2 |a 
The remainder is still 1. 


Step 8: Is 1 equal to or greater than 1? Obviously, they are equal and 
therefore place a‘1’ in the last column. 

Mince vargas) 27 ae 2 aliases 2? ee 

128 |64 | 32 146 | | 

Pe Oe 

Therefore denary 213 is represented by binary 11010101. 
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Calculate the binary equivalents 


of the following denary numbers. 


a 69 
'b 193 


| ¢ 239 


Extend your knowledge ». 


The methods to convert binary to denary and vice versa can be performed 

by computer programs. Write and test programs that will carry out these 

conversions. The programs should ask a user to enter either a denary or 

an eight-bit binary number. 

Remember to validate their input: 

¢ the denary number should be no larger than 255, and 

¢ the binary number should have eight digits and each digit must be 
either 1 or 0. 


Binary arithmetic 
Binary numbers can be manipulated in the same way as denary ones. 


Denary addition 
When addition is performed in denary, a ‘carry over’ is used if the result is 
greater than 9. 


369 + 733 

Place iit Osaaa 7 | A i ae 

values | 1000 

a 4 ae wes 3 a‘ 

(otal | 2 a 0 2 
100 + 300+ 700=1100 . 10+60+30=100 Oo + iewl2 
The 1000 is carried over | The 100 is carried over. The 10 is carried over 
leaving 100. | leaving 2. 


Binary addition works in the same way but a ‘carry over’ is needed if the 
result is greater than 1. 


Binary addition 


11010110 + 01100111 
Place Le AD: mee 2? ese 2 21 a9 
values =‘: 128 | 32 | - | 


i aa E 7 1 me 1 10 = 1 0 
—_ ee 0 0 ii Pix 1 

Total | 0 0 A fd ote” ac 
128+128=256 | 64+64=128 |No | No |No | 4+44+4=12 1242-4 No 
This should be | The 128 is carry =| carry carry =| The 8 is The 4is Carry 
carried over. carried over, . OVEr over over carried over | Carried over 


needed. needed. | needed. | leaving one 4. | over | needed. 
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We are adding eight-bit numbers and this has caused a problem. All eight 
bits have been used and the 1 that was carried over in the last column 
has nowhere to go — it has been carried out. Therefore the result of the 
calculation would be wrong. This is called an overflow error. 


This condition occurs when a calculation produces a result that is greater 
than the computer can deal with or store. 


When this occurs, the processor is informed that an error has occurred. 


Extend your knowledge hh 


Overflow errors have led to many real-life disasters. 

The most famous, or infamous, was the crash of the Ariane 5 space rocket, 
which exploded 40 seconds after its launch in 1996. 

Others include the failure of the Patriot missile defence system in 1991. 


Carry out research to find out more details about these ‘overflow’ 
disasters and problems. 


Signed and unsigned numbers 

So far we have been considering positive whole numbers or integers, but 
how can we represent negative numbers? When an integer is indicated as 
being positive or negative it is described as being signed. 


In denary, this is simple, a ‘+’ or a ‘—’ symbol is placed in front of the number. 
In binary, if we are using 8-bit numbers, then we cannot do this because we 
cannot simply add an extra bit. Also the processor would not understand the 
‘+’ and ‘—’ symbols. 


Sign and magnitude 

In a multiple-bit binary number, the left-most bit, the one with the greatest 
value, is called the most significant bit (MSB). We can use this to represent 
signed integers. 


It is set to 1 for negative and 0 for positive. 


Therefore, using this method, binary 00011100 represents denary +28 and 
binary 10011100 represents -28 


The most significant bit is being used for the sign, so the largest positive 
number represented by a byte is 127. 


Adding signed integers 
If the binary equivalents of denary +28 and denary ~28 are added we get the 
following result. 


A eis 1 0 a 0 
i Fe jp ee ee 
a ass 0 0 0 


OF 


Carry out the following binary 
additions. 


| a 10011010+ 11010111 | 
| b 00001101 + 10101010 | 
_¢ 11010111 + 10001010 ; 


Integer: a whole number (e.g. 3, 
6, 9). 

| Most significant bit (MSB): the 
bit with the highest value in a 
multiple-bit binary number. 


The most significant bit 
represents a negative number 
rather than a sign. 


Adding denary +28 and -28 we expect the result to be 0, but the binary 
result is equivalent to -56. 


Also using this method zero could be both positive and negative. 


Positivezero| 0 0 | O | O , O; Oj O | 0 
Negative 1 OG) 20 0 a OS ae 
Zero 


That clearly doesn’t make sense and so another method is needed to 
represent negative numbers in calculations. 


Two’s complement 
The most common method used to represent signed integers in modern 
computers is two’s complement. 


This method works on the principle above - the result of any number added 
to its negative equivalent should be zero. 
Finding the two’s complement of a binary number 


To find the two’s complement of a binary number 


¢ flip all of the bits - change 1s to Os and Os to 1s. 
¢ add 1 to the result. 


Number 0. 0. 0 | et 0 0 

flip | 2 | 2 | 1 | © | Og 0 1 

Add 1 to 1 iol ran sa0 0 0 0 
flipped value 


The MSB is 1, indicating a negative number. 


Therefore the two’s complement of binary 00011100 is binary 11100100, 
which is equivalent to denary -28. 


Converting two's complement numbers back into denary 


This example shows that the two's complement binary number 11100100 
represents denary ~28. We can check whether this statement is true by 
converting it back from two’s complement. 


Method 1 - Reverse calculation 


The positive equivalent of a negative integer in two’s complement format 
can be found by flipping the bits and adding 1. 


Number Vien mI) <0. 7,0 | UG 


roel 


-28 
ip § 0) 0,0 1 1. | Guin eee 
Add 1 to oO; 0 | o | 1 | aoe ee 

flipped value | 


The result is denary 28. 
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This shows that 11100100 (—28) is the two’s complement representation of 
00011100 (+28). 


Method 2 - Place values 


Using place values the two's complement number can be converted directly 
into denary. 


Place 2’ 2° 2 2° 2s 2? 2) 1 po 
Wales WENDReNIEA 32 #216 «#8 & ors ia 
1 1 1 0 0 1 :0 0 


The most significant bit of the two’s complement binary number is 1, 
indicating that it is negative. Therefore, the denary equivalent of the MSB is 
taken as being negative -128. 


The denary equivalent will be 
-128 + 64+ 32+4=-128+100=-28 


This shows that two’s complement 11100100 is equivalent to denary -28. 


Convert -69 to an eight-bit two’s complement binary number. 


Pc. i ae ae + ee, aan. oa 


values 128 64 32 16 8 4 ree | 2 
69 0 1 0 0 0 1 0 vi 
Flip 1 0 1 . a 0 — 
Add 1 1 0 1 1 1 0 1 1 


So denary -69 is 10111011 in two's complement binary. 
We can check this using the place value method. 


-128 + 324+ 16+8+2+1=-128 + 59 =-69 


Adding -28 to +28 
The two’s complement of 00011100 is 11100100 


So adding the binary equivalents of denary +28 and —28 gives 


28 a 0 0 cee re eeu! 0 
-28 es oa le al | as aT 6 
Result of addition 0 0 0 ceca 0. | s0 


There is an overflow, but this time the result is 0. 


og 


Additions using two's 
complement can be used for 
carrying out subtraction in binary. 
For example, 


28-5 
is the same as 
28 + (-5) 


Extend your knowledge [&\ 


Write and test a program in the 
language you are using which will 
calculate the two’s complement 
of an 8-bit binary number. 


Remember to validate the binary 
number entered. 


Add 28 and -5. 
This is more difficult. The result should be 23. 


! 


| 


Activity b 


The two’s complement representation of —5 is 


s- |e | Or 20), SO, Oa I 
Flic’) 1 | ae) Tete 
eAddayy 0 | 1 eo 


Adding binary 28 to binary —5 gives 


Oe 08 ho "AG eo 1 1 
1 iad 1 0 
Oona «Oo. . 20 1 0 1 


The result is 00010111, which is the binary equivalent of 23. 


Add —28 to +5. 
This is slightly more difficult. The result should be —23. 


-28intwos: 1 / 1: 1 1 0 0 1 
complement 


5) fOr Or CO atin Or + ea. 
Result | 1 | 1 Pee 0 At) 


We can show that this is equal to -23. 


28 164432484 l=—128 + 105 =—23 


1 Calculate the two's complement representation of the 
following denary numbers. 


a -113 
b -56 
c -90 


2 Showing your working, carry out the binary addition of 
90 + -33. 
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Binary shifts 


In denary when we want to multiply a number by ten we move each number 
one place value to its left and add a 0 to the end. 


For example, 13 * 10% 130. 


Place values 10? =3=110? «110! 10° 
1000 100 10 1 

Original number ! | | 1 | 3 

Multiply by 10 ~ the digits are | | | 3 | 0 


moved to the left and a 0 is added. 


To divide by 10 we do the reverse - move each number one place value to 
its right. 


A left shift is used for multiplication by 10 and a right shift for division by 10. 


If the number is being multiplied by 100 (102), then each digit is moved two 
places to its left. 


In binary left and right binary shifts can be used for multiplication and 


division by powers of 2. 


Logical shifts 
For unsigned numbers logical shifts are performed in the same way as in 
denary. 


If we multiply the unsigned binary number 00010100 (20) by 2? (4) there 
should be two shifts to the left. 


The left-most bits drop off the end and are replaced by Os at the right. 


Pimeeeyatuess 2’ 2 ee 
128 64 32 16 8 ih 2 


Result of shift 0 1 0 1 0 | 0 0 
The product is (64 * 1) + (16 * 1) = 80 in denary as expected 


If we divide the same number by 4 (22) there would be two shifts to the 
right. This time the right-most bits drop off the end and are replaced by Os 
at the left. 


Placevalues 2’ 2° . Pag . eo . fe . 2° | Zz) | 2° 
128 64 32 16 8 re 2 
0 0 @) ] 6) ] 0 0) 


Result of shift . 0 10 0 0 0 (ea 0 


The result is (4 * 1) + (1 * 1) = 5 in denary as expected. 
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Binary shift: an operation done 
on all the bits of a binary value 
in which they are moved by a 

| specific number of places to 

| either the left or right. 


Activity 5 


Calculate the results of the 
following shifts on unsigned 
integers. 


a 00111010* 2°. 
b 1061110122 


Arithmetic shift: used for 
signed binary numbers. When 
, performing a right shift the bits 
| at the left are replaced by copies 
of the most significant bit. 


A logical shift on integers can lead to a loss of precision. For example, if we 
divide 00100001 (33 in denary) by 2 the digits would be shifted one place to 
the right. 

Place values 2” 26 Bs = eee 2 7 | 2 2° 
128 64 32 16 8 4 Z 1 
fo Ot tome Omg 0. eameer lea 

Resultofshift 0 |0 i090 ‘1 ‘0 Oo [0 0 

The result is 00010000, which is 16, but the correct result should be 16.5. 


Arithmetic shifts 
Arithmetic shifts are used with signed numbers expressed in two's 
complement format. 


Left arithmetic shift 

A left arithmetic shift is identical to a left logical shift except that the 
left-most bit (MSB) is not included because it must remain in place to 
indicate the sign. 


Calculate the product of -36 * 2 
-36 in two's complement format is: 


36 0 1 0 0 0 1 0 0 
Flip , i | 1 7; © 1 1 0 1 1 
addi 1}. 0 1 1 1 0 0 


Therefore -36 in two's complement format is 11011100. 


If this is multiplied by 2 there should be one shift to the left with a 0 
added at the right. 


Leaving the MSB in place and shifting the others to the left will produce 
10111000 
This should be equal to -72 in two's complement notation. 


We can see if this is correct by proving that 10111000 is the negative of 72 
in two's complement. 


wey ry oyi 1) th OF oes 
Flip | O 1 | 0 C0 fee 
Add1) 0 1 | 0 0 (Se cem Bex. 0 


01001000 in binary is equal to 72 in denary. 
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Right arithmetic shift 


When dividing signed binary numbers in two’s complement format by 
powers of 2, the bits are shifted to the right, but they are replaced at the left 
by copies of the most significant bit. 


Calculate -72 / 2’. That is -72 divided by 4. The result should be -18. 


—72 in two's complement format is 10111000 — we showed this in the 
previous worked example. 


Performing two right arithmetic shifts and adding the MSB at the left will 
give 


LG. 0 
This should be ~-18 in two’s complement notation. 


We can test it as before. 


-18 1 i 1 0 1 il 0 
Flip 0 0 0 1 0 Qa al iia 
Add1 0 0 0 1 OiscienO Maat 0 


The result is 00010010, which is equivalent to (1 * 16) + (1 * 2) = 18. 


Hexadecimal numbers 


So far only 8-bit binary numbers have been considered, but it is common for 
data and commands to be represented by 32 and 64 bits. 


It is difficult for humans to remember and manipulate 8-bit binary numbers, 
but almost impossible to use 32-bit ones. 


Humans get confused with large binary numbers, so we use hexadecimal 
numbers to represent them. Computers never use them. 


The hexadecimal system uses 16 digits from 0 to 15. 


Our denary system has only digits for 0 to 9, so the higher hexadecimal 
digits are represented by upper case letters. 


Omens eee As 15 ee Ge 8 9 TOD Eee 
Get 2 3 GES ¢ BOP AREIC D FE 


The following diagram shows how to convert binary to hexadecimal. 
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| 1 Show the following 


division: | 
a 10010000 / 22 in binary 
b 11110110 / 21 in binary 
11000000 / 23 in binary | 


| C 


| 2  Showthat your answers are 
the expected results. | 
Sacto “ — 


A common use of hexadecimal 
numbers is to describe colours 
used in computer graphics. Each 
of the colours has a value of 

24 bits — 8 bits for each of the 
red, green and blue elements 
that make it up. The 24 bits are 
represented in hexadecimal so 
that they are easier for humans 
to use. 


For example, it is far easier to 
set the colour to red by entering 
#FFOOOO than having to type 
111111110000000000000000. 


A # symbol is used to indicate a 
hexadecimal number. 


Hexadecimal: a base-16 number 
system. There are 16 digits and 
the place values increase in 
powers of 16. 


Activity 7 

Convert the following 
binary numbers to 
hexadecimal. 

a 01101110 

b 10011100 

c 00101010 


Convert the following 
hexadecimal numbers 
to binary. 

a A6 

b 9C 

e-7D 


Converting binary to hexadecimal 


‘ 


[1] 1]1]0| Nibbles fo} a] 1 


Here is a byte (8 bits) 


place values. 


These are the denary 
equivalents. 


representation of that 
denary value. 


Figure 3.1 Converting binary numbers to hexadecimal 


It is far easier to say and remember E6 than 11100110. 


Converting hexadecimal to binary 
This is the reverse of the method used above. 


12 | 


ad , 


1 a Add together the following 8-bit numbers. (1 mark) 


Crle Cai muO Ged 
i OO ia 


b Identify the problem that this addition has created. (1 mark) 
Carry out a 3-place logical right shift on the following binary number. (2 marks) 


10010011 


First convert each digit to denary. 


Now convert each denary number 
into 4-bit binary numbers. 


Figure 3.2 Converting hexadecimals to binary numbers 


Explain the effect of performing a right shift on a binary number. (1 mark) 


Describe the steps needed to convert the binary number 11101110 into a hexadecimal one and show 


the result. (4 marks) 
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To convert the binary to 
hexadecimal the byte is split 
into two halves of 4 bits each. 
These are called nibbles. 


Each nibble is now converted 


into its denary number. Here 
are the nibbles with their 


This is the hexadecimal 


1 a This is a straightforward question. Carry out the calculation and 
write the result in the space provided. Remember to carry over if the 
addition of each pair of digits is greater than 1. 


b This just requires a one or two word answer. 
2 a Again carry out the shift and write the result. 


b Here you have to explain what effect the shift will have. You could 
say that it is equivalent to multiplying the number by... 


¢ A longer answer is required. You should show stage by stage how 
the conversion is carried out. For example, you could start by saying 
what the 8-bit binary number is divided into. You should set out the 
explanation clearly and it could be in the form of a diagram. 


¢ In acomputer all data and programming instructions are represented by streams of 1s and Os. 
e Place values in the binary system increase by powers of 2. 


e An overflow error occurs when a computer attempts to handle a number that is too large (e.g. a number 
requiring 9 bits instead of 8 in an 8-bit processor). 


¢ To multiply a binary number by powers of 2, you can use left shifts. 

¢ To divide a binary number by powers of 2, you can use right shifts. 

* Hexadecimal is used to represent binary numbers because it is easier for humans to remember and use. 
¢ Binary, denary and hexadecimal numbers can be converted from one to the other. 


$1 Carry out the following addition: 11011001 + 10010010. 
S2 Perform the following calculation in binary: 25 + (-13). 
$3 Multiply the unsigned integer 01001001 by 2°. 

$4 Convert 11001101 to denary and hexadecimal. 


C1 Explain in your own words why binary is used to represent data and program instructions in computer systems. 
C2 Explain the difference between a logical right shift and an arithmetic right shift. 


How confident do you feel about your answers to these questions? If you're 
not sure you answered them well, try the following activities again: 


° For S1 try Activity 3. 
e For S2 try Activity 4. 
¢ For S3 try Activity 5. 
° For S4 try Activity 7. 
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3.2 Data representation 


Leaming outcomes » 


By the end of this section you should be able to: 
e explain how computers encode characters using ASCII. 


¢ explain how bitmap images are represented in binary. 
¢ explain how sound is represented in binary. 
* describe the limitations of binary representation of data when constrained by the number of available bits. 


e calculate the file sizes of stored data. 


Representation of text 
When you press a key on the keyboard a stream of 1s and Os are input to 
the processor. The computer cannot recognise any characters such as ©’, 
‘@’ or ‘D’, only a combination of 1s and Os. The defined list of characters 
Keytem recognised by a computer's hardware and software is known as its 
| Character set: the defined | character set. 
list of characters recognised | 


by a computer's hardware and 


software. | 


| Obviously, an industry standard is needed or data will not be 
interchangeable between computers from different manufacturers. 


This standard is provided by the American Standard Code for Information 
Interchange, or ASCII code. 


Long before ASCII another standard binary code was invented to 
represent text. This was the Morse code, which uses lights, tones or clicks 
that are either of short (‘dots’) or long (‘dashes’) duration. The duration of 
a dash is three times the duration of a dot. 


Each character is represented by a sequence of dots and dashes. For 
example, the letter ‘C’ is represented by ‘dash dot dash dot’. 


Originally the ASCII code consisted of 7 bits and so 128 characters could 
be represented, 


All of the lower and upper case English characters, punctuation marks 
and control actions such backspace, shift on, shift off and carriage return 
were represented. 


Activity 9 > The printable characters of the 7-bit ASCII code are shown in Figure 3.3. The 


. — codes for control actions (mentioned above) are not shown. 
Using the table in Figure 3.3, 


write down the ASCII codes for If you look at the table you can see that the codes are grouped according 
the characters in the following: to function. Important groups are 65 to 90 for the upper case alphabetic 
characters and 97 to 122 for their lower case equivalents. The numeric 
characters 0 to 9 are represented by codes 48 to 57. 


ASCII code. 
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“en a re ‘slAmmetent <o ¢ |p| 8B 

32 00100000 « space | 57 00111001) 9 82 01010010 
33 00100001! | 58 00111010, : | 83 01010011 , 
34 00100010" 59 00111011; 84 | 01010100 
35 00100011 # 60 00111100 < | 85 01010101 
36 00100100 § 61) 00111101f = | 86 01010110 
37 00100101 % 62 00111110 > | | 87 01010111 
38 00100110 & 63 001} 2 | 88 | 01011000 
39 00100111 64 01000000 @ 89 01011001 
40 00101000 = ( 65 01000001 A 90 01011010 
41 00101001 +) 66 01000010 B 91 01011011 
42 00101010 * 67 01000011 «CSCO - 92 01011100 
43 00101011 + 68 01000100 D | 93 01011101 
44 00101100, 69 01000101: E 94 01011110 
45 00101101 - 70 01000110 «F 95 01011111 
46 00101110. 71 01000111 G 96 01100000 
47 00101111 / 72 01001000 =H | 97 | 01100001 
48 00110000 0 73 01001001 | 98 01100010 
49 00110001 1 74 01001010, J 99 01100011 | 
50 00110010 2 75 01001011 +=~«K 100 01100100 
51 00110011 3 76 01001100 iL 101 01100101 
52 00110100 4 77 01001101 M | 102 01100110 
53 00110101 ‘5 78 01001110 N 103 01100111 
54 00110110 6 79 01001111 O 104 01101000 
55 00110111 7 80 01010000 P 105 01101001 
56 00111000 8 81 01010001 Q 106 01101010 


Figure 3.3 The printable characters of the 7-bit ASCII code 


NU <x S3<— cea wu a> A 
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Di B | 
107, 01101011 | 
| 108 01101100. 
109 01101101 om 
110 01101110: on 
111 01101111) o 

p 

q 


— x An 


112 01110000 | 
113 , 01110001 | 

114 01110010 sr 
115 01110011 s 
116 01110100, t 
11701110101, u 
118 01110110. v 
119 01110111 w 
120 01111000 x 
121 01111001 
122 | 01111010 
123 | 01111011 | 
124 01111100 | 
125 01111101 

126 01111110 ~ 
122 011111111 | aEL 


| 
lb HIN 


KEY: D=denary 
B = binary 
C = character 


The groups of the characters can be important when you are writing a 
program. For example, if you are encrypting some text using a simple 
Caesar cipher shift key (see section 3.4, page 126), you can easily find the 
corresponding letters. 

For example, suppose the encryption key is +3 (a right shift) and you 
enter the letter ‘D’. The ASCII code for ‘D’ is 68 and so the encrypted value 
would be 71(G). If the encryption key is -3 (a left shift), then the plain text 
letter ‘D’ results in the cipher text letter ‘A’ (ASCII 65). 


Analysing a string 

To a computer a string is just a stream of binary codes that represent the 
characters. Programming languages have functions that will return the 
denary equivalents of those codes and vice versa. 


The Edexcel pseudo-code does not have these functions built in, but in the 
Python programming language they are ord() andchr(). 
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Encryption: the process of 
encoding a message into a form 
that only the intended recipient 
can decode, or decrypt, and read. 
The message is encoded using an 
agreed method or algorithm. This 
is called the key. The encrypted 
message is called a cipher. 


Therefore 


* ord(‘c’) would return 99, and 
¢ chr(100) would return d’. 


Using the ord() function the ASCII codes of the characters can be found 
when traversing a string. 


By inserting the ord( ) function into the Edexcel pseudo-code, the 
algorithm that does this can be displayed. 


SEND “Please enter the text” TO DISPLAY 
RECEIVE myString FROM (STRING) KEYBOARD 
FOR index FROM 0 TO LENGTH(myString) — 1 DO 


SET number TO ord(myString[index]) #The function 
is called using 
the character in 
the string as an 
argument. 


SEND number TO DISPLAY 
END FOR 


By using the chr() function you can build up a string using characters 
entered by their denary codes. 


The following algorithm will allow you to enter ten numbers which will be 
converted to characters and appended to the string. 


SEL mystring LO 7" #This will create an empty string. 
FOR index FROM 0 TO 10 DO 


SEND “Please enter a number in the range 65 to 90 or 
in the range 97 to 122” TO DISPLAY 


RECEIVE number FROM (INTEGER) KEYBOARD 

SET character TO chr(number) 

SET myString TO myString + character 
END FOR 


Creating the functions 


The pseudo-code does not have these functions built in. You need to define 
them using the commands and statements that are available so that you 
can call them when required. 


The ord() function needs to take the character as a parameter and return 
its denary code. 


You can do this in the following way. 
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FUNCTION ord(character) 
BEGIN FUNCTION 
SET arrayAscii TO [["!"][(33], ["°""][34],..[“z"][122]] #A two-dimensional array 
containing all of the 


characters with their denary 
codes. 


FOR index FROM 0 TO LENGTH(arrayAscii) — 1 DO 
IF character = arrayAscii[index, 0] THEN 
code = arrayAscii[index, 1] 
END IF 
END FOR 
RETURN code 
END FUNCTION 


| Create a function in pseudo-code that will return the character from the 
denary code entered. | 


Activity 10 7 — ' Extend your knowledge p&, 


Create an algorithm that will encrypt the following sentence: The original ASCII code with its 
96 printable characters is very 
restricting as many more are 


Encrypt it with a shift of 3 to the right (i.e. A should be encrypted as D). needed to cater for all the world’s 
languages. 


‘The ASCII code represents characters. 


(Remember: the spaces and the full stop should not be changed.) It was changed to an 8-bit code as 


Output the encrypted text. Extended ASCII, following which 
Unicode was developed. 
Present your algorithm as pseudo-code or code and test it in the Carry out research into these two 


programming language you are studying. systems. 


Representation of bitmap images 


Images are also represented and stored as binary code. Keyterm 


The basic unit of a bitmap image is the pixel. Pixel: short for ‘picture element’, 
the smallest single point of 
Image size and resolution colour in a graphic image. 


Image size is usually written as two numbers, for example 640 x 480, 
2240 x 1680 or 4064 x 2704. 


The first number indicates the number of pixels along the width of the 
image and the second is the number in the height. 


Thus a 640 x 480 image is made up of 307,200 pixels and a 4064 x 2704 
image comprises 10,989,056 pixels. 
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4288 x 2848 pixels 


Resolution: the number of 
pixels per inch when the image 
is displayed (for example, ona 
monitor or on paper). 


:= 


The following images are the same size but have different resolutions. 


100 x 66 pixels 


There are different numbers of pixels per square inch in the two images. 
Each pixel in the 100 x 66 image is a larger size than those of the 

4288 x 2848 image. The greater the number of pixels within a given area the 
higher the resolution and the more detail shown. An image of size 100 x 66, 
when displayed in the same area as one of 4288 x 2848 pixels, will have a far 
lower resolution as there are far fewer pixels. The lower the resolution the 
less the amount of detail seen. 


Encoding the pixel information 
Each pixel is represented by a ‘bit pattern’ - often the same pattern used to 
represent a character in the ASCII code. 


The number of different colours that can be used is determined by the 
number of bits used to encode colour. 


For example, if one bit is used for each pixel, then there will be only two 
colours — black and white. 


The following diagram shows an encoding system with one bit used to 
represent each pixel. 


Figure 3.4 1-bit encoding forming the letter ‘H’ 
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If 0 represents black and 1 white, the letter ‘H’ could be encoded with the 
following bit pattern. 


11011011 
1021011 
11011011 
11000011 
11000011 
11011011 
11011011 
11001011 


The complete image consists of 64 pixels and because 1 bit is used for each 
square, the size of the image is 64 bits or 8 bytes. 


As the colour depth increases so too does the number of colours that can 
be represented. 


If 1 bit is used then the number of colours is 2! or 2. 
If 3 bits are used the number of colours is 2? or 8. 
If 8 bits are used the number of colours is 28 or 256. 


The current standard represents the colour of each pixel in 24 bits. There are 
8 bits for each of the red, blue and green primary colours. 


There are therefore 256 variations of each primary colour contributing to 
the overall colour of the pixel and that gives 256 x 256 x 256 or 16,777,216 
different colours. 


This standard allows lifelike images, so it is often referred to as true colour. 
This shape is filled with a colour named Alice blue’. 


In binary you would have to enter 111100001111100011111111 every time 
you wanted to use it. 


However, as mentioned in section 3.1, hexadecimal comes to our rescue. 


Using hexadecimal you would only have to enter #FOF8FF. 


File sizes 

The file size for a bitmap image is calculated by finding the total number of 
pixels and multiplying that by the number of bits used to represent each 
pixel or 


Width x Height x Colour depth 
The file size of the left-hand image on page 110 is 
4288 (width) x 2848 (height) x 24 (bit colour depth) = 293,093,376 bits 


That is 36,636,672 bytes or almost 37 megabytes. 


iM 


Colour depth: the number of bits _ 
used to encode the colour of each 


pixel. 


As the size of the image and its 
colour depth increases so too 
does the amount of data that has 
to be stored and manipulated. 

If you were visiting a website 
with 4288 x 2848 24-bit images 
then, unless you had a very fast 
connection, you would quickly 
become frustrated as you waited 
for them all to download. Lower 
resolution images have to be 
used as a compromise between 
quality and file size. 


Activity 11 ? 

Calculate the file sizes of the 

following images and express the 

sizes in bits, bytes and megabytes. 

1 A256 colour image with a size 
of 640 x 480 pixels. 


2 Atrue colour image with a 
size of 640 x 480 pixels. 


Representation of sound 


All sounds are caused by vibrations. As objects such as our vocal cords or 
guitar strings vibrate backwards and forwards they push the air molecules 
alongside them, sending a wave of compressed molecules through the air. 
When these compression waves, or sound waves, reach our ears they set up 
vibrations in tiny sensory hairs in the inner ear. This sends nerve impulses to 


the brain, which interprets them as the sounds we hear. 


Sound wave High pressure Low pressure 


Figure 3.5 Sound waves travelling through the air from a vibrating bell 


Figure 3.6 illustrates the different waves for loud and quiet and high- and 
low-pitched sounds. 


oT 


Amplitude 


® 


Louder Quieter 


Higher pitch Lower pitch 


Figure 3.6 The bigger the wave, the louder the sound: the greater the frequency of 
the waves, the higher the pitch 
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Recording sound waves 


Analogue recording 


Analogue recording represents the continuous changes in air pressure 
caused by sound waves, as changes in voltage. The changes in voltage are 
directly proportional to the changes in air pressure and these in turn are 
stored, for example, as grooves of varying depths in vinyl records or as 
magnetised areas of a magnetic tape. 


When the disc is played the movements of another stylus through the 
groove are converted back into voltage changes. After amplification, these 
cause the diaphragm of a speaker to vibrate, reproducing the original sound. 


It is called analogue recording because the changes in voltage exactly mirror 
the changes in air pressure (or are analogous to them). The complete sound 
waves are recorded. 


Digital recording ‘Did you know? 


It was stressed in section 3.1 (on page 92) that computers do not represent A similar technique is used for 
data in an analogue way where minute changes in voltage are used to producing movies on film. If a 
represent changes in the data. Computers are digital and their transistors series of images, or frames, of 
are either on or off with nothing in between, so sound data, like any other the same scene, but with small 
form of computer data, must be represented as streams of 1s and Os. differences between them, are 


played back in rapid succession, 
our brains think that we are 
watching moving objects. 


Continuous change cannot be represented by a digital stream, so digital 
recordings take a series of sound ‘snapshots’. When these are played back 
rapidly in succession the sound produced seems to be continuous. 


These ‘snapshots’ of the sound are called samples and the process of taking (epic a amr) *; 
them is called sampling. 


| Sampling: taking measurements 
The following diagram shows an analogue sound wave being sampled. | of the sound wave at regular but 


_ distinct intervals of time (e.g. 
44,100 samples per second). 


Each line represents a new sample. The time between each line/sample represents the 
sampling period, which equals 1/44,100 of a second, for a CD with a sampling rate of 44.1 kHz. 


Figure 3.7 How a sound wave is sampled digitally 


The sampling of the continuous sound wave is carried out by an 
analogue-to-digital recorder (ADR), which converts the sound into a 


stream of 1s and Os. 
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Fidelity: the correspondence 
between the original sound 

and the recording. The greater 
the fidelity, the greater is the 
similarity between them. Very 
high accuracy is called hi-fidelity 
or just hi-fi. 

Bit depth: the number of bits 
used to encode the information 
from each sample. Increasing the 
number of bits used increases 
the amount of detail contained in 
each sample. 


Original sound wave 


Low sample 
rate 


High sample 
rate 


A digital-to-analogue converter is required to convert the binary data back 
into an analogue wave. 


The complete wave cannot be recorded and so a digital recording can 
never be as accurate or have the same accuracy, or fidelity, as an analogue 
recording, but it does have other benefits such as: 


¢ Equipment to record and process digital sound is relatively cheap and has 
allowed people to record music at home. 

¢ Itis easily edited using computer equipment. 

e |t is easily copied. 

¢ Unlike vinyl and tape, digital files do not physically deteriorate with use, 
although they may become corrupted if the storage medium is damaged. 

¢ Digital files are more portable than records and tapes and can be copied 
to any digital medium, such as a hard drive, emailed, downloaded and 
streamed. 


Fidelity 
The fidelity of the recording is influenced by two factors. 


Sample rate 
This is the number of samples taken per second — the higher the rate the 
higher the fidelity. 


Figure 3.8 shows the effect of sampling at different rates. 


A sampling rate of 44,100 per second (44.1 kHz) is used for CDs and of 
96,000 Hz (96 kHz) for Blu-ray™ audio. 


Bit depth or sampling precision 


One important detail is the dynamic range of the sound - the range of 
volumes of sound in the music. 


Wave after sampling 


e Using 8 bits allows 256 (2%) 
gradations to be measured. 
¢ 16 bits allows (2'°) 65,536, and 


¢ 24 bits allows (2°) over 16.7 
million. 


Therefore using more bits allows 
for much smaller gradations in the 
volume differences. 


For CD recordings a bit depth of 16 
bits is used. 


Figure 3.8 The effect of sampling at different rates 


114 


Digital audio file sizes 
The size of a digital audio sound file depends on the following: 


* sample rate per second 

¢ bit depth 

¢ duration of recording 

* number of channels — mono (one channel) or stereo (two channels). 


Calculating file size 


Calculate the file size of a stereo 


number of samples per second = 44,100 


bit depth = 16 bits |_| recording of three minutes’ 
duration = 2.5 minutes _ | duration with a sample rate of 
number of channels = 2 44,100 and a bit depth of 24 bits. 


The size of the file is found from the following formula: 

file size in bits = sample rate * bit depth * duration (in seconds) * 
number of channels 

Therefore the file size of the above recording is 

44,100 * 16 * 2.5 * 60 * 2 = 211,680,000 bits or 26,460,000 bytes or over 
26 megabytes. 


1 Explain what is meant by a ‘pixel’. (2 marks) 
2 The following diagram shows a black and white image consisting of 36 pixels. 


Figure 3.9 A 36 pixel image of the letter E 


a Explain why 36 bits are needed to represent the pixels in the image. (2 marks) 
b Write the bit pattern needed to represent these pixels. (4 marks) 
c State the number of bits per pixel that would be needed if the image was 16 colours rather than 2. (1 mark) 
3 Jane is using a computer program to record herself playing the guitar. 
a Describe how the program converts her music into a file. (3 marks) 
The software allows her to increase the sample rate. 
b Describe two effects this will have on her recording. (2 marks) 
c She eventually records the music in mono with a sample rate of 44.1 kHz and a bit depth of 16. If the 
recording lasts for two minutes, calculate the size of the file produced. Express your answer In 
megabytes. (2 marks) 
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1 This simply requires a one-sentence definition. 
2 a You need to explain why 1 bit will be needed to encode the colour information of each pixel in a black and 
white image. 


b You should show the bits required for each line of the image. Don't forget to state the type of bit used to 
represent each colour. 
You could represent this in powers of 2. 
3 a This requires a longer answer. You should start with describing how the sound reaches the recording device 
and then how it is converted into a digital format. 


b Two effects are needed — quality and size. 


This requires you to apply the formula to calculate file size. Read the question carefully so that you use the 
correct figures in the calculation and remember that sample rate is defined as samples per second. 


summary hb 


¢ All data is represented by bit patterns — sequences of 1s and Os. 

¢ The ASCII code represents text characters as patterns of 7 or 8 bits. 

¢ A pixelis the smallest single point of colour in a graphic image. 

¢ The amount of colour information used for each pixel is called the ‘bit depth’. 


¢ The file size of a bitmap image is calculated by multiplying the number of pixels in the width of the image by 
the number in the height and then multiplying by the number of bits used to encode each pixel. 


¢ Digital sound recordings are produced by sampling analogue sound waves. 
¢ The sample rate and the bit depth affect the fidelity of the recording and also the file size. 


Strengthen 


S1 Represent the following sentence using the denary ASCII codes for the characters: 
Data is represented as bits. 


S2 An image is described as being 640 x 480. If it has a bit depth of 24, calculate the size of the file. 
Challenge 


C1 !n your own words explain why some digital images are clearer and display more detail than others. 
C2 Explain the factors influencing the file size of a digital audio recording. 


How confident do you feel about your answers to these questions? If youre 
not sure you answered them well, try the following activities again: 


¢ For S1 try Activity 8. 
e For S2 try Activity 11. 
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3.3 Data storage and compression 


Learning outcomes me 


By the end of this section you should be able to: 
* convert between the terms bit, nibble, byte, kilobyte (KB), megabyte (MB), gigabyte (GB), terabyte (TB). 
* explain the need for data compression and methods of compressing data. 
e use the run-length encoding (RLE) algorithm to compress a data fille. 


Data storage 


All data consists of bits - 1s and Os. In section 3.2 (on page 111), you 
calculated the size of an example image file as being 293,093,376 bits or 
36,636,672 bytes, and the size of an example sound file (on page 115) to be 
211,680,000 bits or 26,460,000 bytes — a typical byte is equivalent to 8 bits. 


When converting binary to hexadecimal in section 3.1 (on page 104), you 
used a unit called a ‘nibble’, which is equal to 4 bits. 


If these files are saved on computers running different operating systems, 
their sizes in megabytes (MB) are displayed differently. For example, in 
Microsoft Windows’ the image file size would be shown as 34.9 MB, whereas 
in Ubuntu’ it would be shown as 36.6 MB. 


This is because of the method used to calculate higher units. Ubuntu uses 
SI units (International System of Units), which treats ‘kilo’ as meaning 1,000 
as it does when measuring distance and weight. This method is known as 
decimal prefix because it uses powers of 10 (e.g. 10° for 1,000). 


The other method, the binary prefix, uses powers of 2, so ‘kilo’ is defined as 
a2 of 1,024. 


The table shows the differences. 


Unit Symbol Decimal prefix Size 

kilobyte KB 10° bytes | 1,000 bytes 
megabyte MB 10° bytes | 1,000 kilobytes 
gigabyte GB 10° bytes 1,000 megabytes 
terabyte TB 10'* bytes | 1,000 gigabytes 


Therefore, every higher unit is 1,000 times larger than the previous one with 
decimal prefix, whereas in binary prefix it is 1,024 times larger. 
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_ Decimal prefix: multiplies a unit 
by powers of 10. 
Binary prefix: multiplies a unit 
by powers of 2. 


Binary prefix Size 

2’ bytes , 1,024 bytes 

2° bytes | 1,024 kilobytes 
2°° bytes | 1,024 megabytes 
2“° bytes | 1,024 gigabytes 


Although the binary prefix has been used to define file sizes for decades, 
more recent use of the decimal prefix has caused confusion. In 1998, 
binary prefix units were given new names, such as the kibibyte (kiB) and 
the mebibyte (MiB). 

To add to the confusion, different components of computer systems are 
often described using different prefix systems. For example, the size of 
the main memory or RAM (random access memory) might be referred to 
using the binary prefix, so that 1 GB means 1,073,741,824 bytes. 
However, when referring to the storage capacity of a hard disk drive, 

the decimal prefix is used. Therefore a 500 GB hard drive holds 
500,000,000,000 bytes. 


Activity13 
1 Ahard disk is described as having a storage capacity of 1.5 TB. 
What is this in: 
a_ gigabytes 


b megabytes 
c_ kilobytes 
d_ terabytes? 


2 Animage file has a size of 363,143,213 bits. What is its size in 
gigabytes? | 


Data compression 


Files have to be stored somewhere, for example on local or remote hard disk 
drives or on USB drives. If they are graphics or audio files, then these files 
can be very large. 


In March 2015 it was estimated that: 


* 2.5 quintillion — 2,500,000,000,000,000,000 — bytes of data are produced 
every day. 


* Google stores 10 exabytes (10 billion gigabytes) of data at its vast data 
centres. 


* on Facebook more than 200 million images are uploaded each day and 
on Snapchat almost 100,000 each minute. 
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Millions of image and audio files are uploaded and downloaded each day. 


There are many benefits to the files being made as small as possible. The 
users don't waste time while their data is being transferred or saved and the 


providers can store them in as little space as possible. Keyterms 


That is why file compression is so important. Compressed files use Compression: changing the 
less network bandwidth to upload and download, there is less internet | format of a data file so that the 
congestion and it makes possible the streaming of video and audio files. size of the file becomes smaller. 


Lossless compression: 
compressing a file in such a way 
that it can be decompressed 


Compression algorithms are used to make the files as small as possible. 
There are two types of compression - lossless and lossy compression. 


Lossless compression | without any loss of data. 
When lossless compresion is used, no data is lost and the original file can | Lossy compression: 
be restored. | compression where some of the 


data is removed; the original file 
cannot be restored when the 
_ lossy file is decompressed. 


It is especially useful for files where the same data occurs many times, for 
example text, where there are relatively few different items of data (ie. 
character and punctuation symbols). In a page of text there are many words 
that are used more than once. This is referred to as redundancy. 


Lossless compression is essential for text as missing data would completely 


distort the meaning of what was being communicated. 


Run-length encoding (RLE) 

Run-length encoding is used to reduce the size of a repeating string of 
items. The repeating string is called a run and is represented by two bytes 
— the first byte represents the number of occurrences and the second, the 
item of information. 


For example, the following string of letters 
cccmmmmmsssssdddcccccc 

would be represented by 

3c5m5s3d6c 


If one byte is used to encode each letter in the original uncompressed text, 
22 bytes in total are required. 


In the RLE version, one byte is used for the letter and one byte is used for 
the number, making a total of only 10 bytes. 


Obviously, this method is only effective where there are strings of the same 
data. If the original data is very different, without repeating strings, then the 
encoded version would have a greater file size. 


For a black and white image, like the one in Figure 3.10, compression using 
RLE would be effective. A colour image in which there are very short runs of 
different colours would not be encoded as effectively. 
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Figure 3.10 A bitmap diagram of the number 5 


To demonstrate run-length encoding, the letters ‘w’ and ‘b’ are used to 
represent the white and black pixels instead of 1s and Os 


Code RLE version Size of coded version 
wwbbbwww /2w3b3w 6 

wwbwwwww 2w1b5w 6 

wwbwwwww 2w1b5w 6 

wwbbbwww /2w3b3w 6 

wwbbbwww 2w3b3w 6 

wwwwbwww -4w1b3w 6 

wwwwbwww 4w1b3w 6 

wwbbbwww 2w3b3w 6 

64 bytes 48 bytes 


The file size for this character has been reduced from 64 to 48 bytes. It could 
be reduced even further if the number 1 is omitted where there is a run of 
only one. The algorithm could be easily adapted. 


Activity 14 ? 


Show the effect of applying run-length encoding to the following graphic. | 


| Figure 3.11 A bitmap graphic of the letter T 
| Set out your answer as in the example above. 
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Coding the run-length algorithm 
An RLE algorithm will have to 
traverse a string and count the 
number of occurrences of each 
character in the run. 


When the character changes it will 
have to start counting a new run. 


The flowchart shows a possible 
algorithm to do this. 


First the input string is checked to 
see if it actually has any letters (i.e. 
is the length greater than zero?) 

If yes, a check is made to see if it 
only has one letter. If so the run will 
be equal to 1 and the text character 
will be the one at index position 0 in 
the string. 


If there is more than one character 
in the input string, then a loop is 
needed to examine each one. 


If the character at index position 1 
is equal to the character at position 
0, then the length of the run is 
incremented by 1 and the character 
at index position 2 is checked. 


If this character is different, then a 
new run is started for this character 
and the previous character and its 
run length are added to the string 
holding the code. 


This continues until the algorithm 
reaches the end of the string. 


Start 


Pan eee 


: Input text 
to be encoded 


: 


text = input string 
newText = “" 
code = "” 
length = length of input text string 
index =0 


+ —— 


Figure 3.12 A flowchart of a possible RLE algorithm 
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NO 
= -—— — — 7 
NO Is 
a length = 
1? B 
i YES 
: run=1 
’ code = code + run 
$ | _ code= code + text(index) 
run=1 — 
newText = text(index) ‘ 
| - _ 
2 <: Output 
’ code 
index = index + 1 i 
y End 
C Is r : 
index <= = ne code = code + run 
length — 1? code = code + newText 
YES 
, 
D 
ri = NO code = code + run 
run=run+1 text(index) = code = code + newText 
; newText? 
E 
| 
if YES 


1 Six variables are used in the flowchart. List the variables and the data 
items they represent. 
2 Five items are labelled A to E in the flowchart. Match each item below 
with the statement that describes it (e.g. A3). 
1 This checks to see if the end of the input text has been reached. 
2 If there is only one character it and its run length are added to 
the output file. 
3. This adds the character and its run length to the output string. 
This checks to see that some text has actually been entered. 


5 This checks to see if the next character in the input string is the 
same as or different from the one that is being checked. 


Extend your knowledge hh 


Using the programming language that you are studying, create and test a 
program to carry out run-length encoding on an input string. 


Lossy compression 

Lossy compression decreases the file size by deleting some of the data. The 
original file therefore cannot be re-formed entirely when it is decompressed, 
so it cannot be used with text or program files. 


It can be used for bitmap image and audio files where we often cannot 
notice that data has been removed. 


Bitmap images 

Digital images can comprise millions of pixels each with a 24-bit depth. They 
result in very large files. However, many of the minute differences in colour 
are wasted on us. Our eyesight is not capable of distinguishing these small 
differences. 


A lossy compression algorithm analyses all of the data in the image and 
when it finds areas with minute differences it gives them the same colour 
values. It can then rewrite the file using fewer bits. 


The most commonly used compression technique was developed by 
the Joint Photographic Experts Group and produces JPEG files with the 
extension jpg. 


The images on the next page demonstrate JPEG compression. The one on 
the left has a file size of 6 MB, while the compressed version on the right is 
only 350 KB. They both have the same number of pixels, but the algorithm 
made pixels with similar colour values the same. The size has been reduced 
over 17 times, but to our eyes there is little difference in detail and clarity, 
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Audio files 

Uncompressed audio files that contain all of the sampled sound data are 
saved in Waveform Audio (WAV) format. A three-nunute recording typically 
has a file size of about 30 MB. 


Much of the data in an audio file encodes tones and frequencies that our 
ears Cannot hear and small differences in volume and frequency that they 
cannot distinguish. 


The most commonly used algorithm compresses them and encodes them 
as MPEG-1 audio layer 3 files or, as they are commonly called, MP3 files, by 
removing the redundant or superfluous data. 


A 30 MB WAV file can be compressed to a 3 MB MP3 equivalent. 


The following graphs show the differences in dynamic range between a 
WAV file and its MP3 equivalent. They clearly show the dip in range at high 
frequencies. 


dB dB 
Q 0 
12 12 
Ze 24 
6 36 
46 48 
60) 60 
72 1a 
44 84 
96 96 
108 108 
120 120 
dB —_ dB 
Hz 5000 10000 15000 20000 Hz 5000 10000 15000 20000 
MP3 WAV 


Figure 3.13 The difference in ranges between an MP3 and a WAV sound sample can 
clearly be seen 
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Digitally compressing an image makes little difference to our eyes but huge savings in file size 


For Question 1 just list them in 1 Order the following units from smallest to largest. (3 marks) 
ascending order and for Question GB bit TB byte nibble MB 

; ee cou HIS MeEaee 2 Compress the following two-colour bitmap data using 
version. 


run-length encoding. (3 marks) 
111000110000001110001110 


¢ File sizes are measured in kilobytes, megabytes, gigabytes and terabytes. 
¢ Each unit is 1,000 (10°) times larger than the previous one. 

¢ Files can be compressed using lossy and lossless compression algorithms. 
¢ In lossy compression some of the data is lost. 

¢ In lossless compression none of the data is lost. 

¢ Run-length encoding is an example of lossless compression. 


Strengthen 


$1 List the units used to measure file size in ascending order starting at ‘bit’. 
S2 Convert 2.3 terabytes into bytes. 
$3 Encode the following using run-length encoding 

aaabbaaaaaaaaacccddd 

and calculate the file size of the original and the encoded versions. 
Challenge 


C1 Explain why it is difficult to compress digital images using lossless compression algorithms. 


C2 In your own words explain why an audio file compressed using a ‘lossy’ algorithm often sounds the same as 
the original uncompressed version, 


How confident do you feel about your answers to these questions? If you're 
not sure you answered them well, try the following activities again: 


For S1 try Activity 13. 
For S2 try Activity 13. 
For S3 try Activity 14. 
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3.4 Encryption 


Leaming outcomes he 


By the end of this section you should be able to: 
¢ explain why data encryption is needed. 
° explain how a Caesar cipher algorithm works. 


When we send sensitive information such as bank or credit card details over 
an internet connection we check (or should check) the URL to ensure that 

it starts with ‘HTTPS:’, denoting a secure connection. Internet connections 
are insecure and messages are easily intercepted. The secure connection 
ensures that the transferred data is encrypted and is read only by the 
intended recipient. 


HTTPS stands for Hypertext Transfer Protocol (HTTP) with the Secure Other methods have been used 
Socket Layer (SSL)/Transport Layer Security (TLS) protocol added to it. It to hide messages. A Roman 
ensures that data transfer is encrypted. general sent a secret message 
by shaving off a slave's hair and 
Ever since people began to use writing, encryption has been used to make tattooing the message on his 
messages secret. In ancient Egypt, scribes slightly changed hieroglyphics so head. When his hair had grown, 
that the general public could not understand them. the slave delivered the message 


to the recipient who had been 
told how to find it. 


Modern methods of data encryption 


Asymmetric encryption 

This method encrypts and decrypts data using two different keys. Every user 
has two keys, a public key, known to everyone, and a private key, known only 
to themselves. A message encrypted with a particular public key can only be 
decrypted by the corresponding private key. 


If person A wanted to send an encrypted file to person B, person A would 
encrypt it with person B's public key. On receipt, person B would then decrypt 
the file with their private key. 


It is commonly used to send encrypted messages because anyone who wants 
to send an encrypted message can get the intended recipient's public key 
from a public directory. 


Symmetric encryption 
This method encrypts and decrypts a message using the same key. 


This is the method used by an HTTPS connection. The client contacts the server 
and they establish a secure connection. The client generates the key to be used 
using an agreed algorithm and sends it to the server using the server's public 
key (asymmetric encryption). The server decrypts it using its private key and 
then they both use the common key for their secure transmissions. 
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Plain text 
(input) 
Cipher text 
(output) 


Encrypt the following using the 
Caesar cipher with a key of +3. 


This is a message 


OP Je ir aaa 


¢ The ASCII codes are 65 to 90 
for the upper-case alphabetic 
characters and 97 to 122 for 
lower-case ones. 


¢ A key of +3 means that the 


characters in the message are 


shifted three places to the 
right. Whereas if the key is —3 
the characters are shifted to 
the left. 


¢ Have a look at Activity 8 on 


page 106 and the Top tip on 
page 107 in section 3.2. 


* Have a look at analysing a string 


on page 107 in section 3.2. 

¢ Remember that string index 
positions start at 0. 

¢ Have a look at Activity 10 on 
page 109 in section 3.2. 


The Caesar cipher 
The Caesar cipher is a very simple encryption method and would not be 
used today because it is so easy to crack. 


It is named after Julius Caesar who encrypted messages in the following way. 


The letters of the alphabet are shifted a set number of places. A positive 
shift moves the letters to the right. A negative shift moves them to the left. 


For example, with a key of +2, the alphabet would be changed to that shown 
on the bottom row. 


Tr pi clipe FlGsHl bitte) MONO Or Rs i yy ee 


J K LIMPN OP OR 9S) UV Oe oe 


With a key of of -3 the letters of the alphabet shift three places to the left, 
so the word ‘SECRET’ becomes ‘PBZOBQ’. 


Notice how the letters at the end move to the start of the alphabet. 
So if the message was 

PLEASE SEND MORE TROOPS 

the encrypted version using a key of +5 is 

UQJFXJ XJSI RTWJ YWTTUX 


If the recipient knows the key they can decrypt the message by shifting the 
letters in the opposite direction by the required number of places. 


Algorithm for the Caesar cipher 
This is a perfect example to practise your computational thinking skills. 


You are going to write a program to encrypt and decrypt a message using a 
specific key using the Caesar cipher method. 


Here is a decomposition of the problem: 


Determine whether the message has to be encrypted or decrypted. 
If the message is to be encrypted, then 

Ask the user to enter the message. 

Ask the user for the key. 


wu F&F WN - 


Analyse each character in the message and if it is a letter calculate the 
letter the ‘key’ number of places to the right if the shift is positive or to 
the left if it is negative. 


6 If it is a punctuation mark then do not change it. 


7 Build up a new message using these new letters. 
8 Output the new message. 


If the message is to be decrypted, repeat the above, but reverse the shift. 
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Here is a flowchart of a possible solution. 


All this seems straightforward, but there are some more sub-problems. ¥ 


¢ Your program will need to check that the user input ~ the key and the 


message — has actually been entered. 


¢ The algorithm will have to start at the first character and move to the 
last (i.e. it must traverse the string), therefore your program will have 


to find the length of the message. 


* Some of the letters may be entered in upper case and some in lower 
case. Your program should be able to cope with this and encode or : 


decode them. 


¢ Your program will have to leave any characters such as spaces, 


commas and full stops as they are. 


Your program will have to cope with letters near the 
start and end of the alphabet. 


For example, if the key is +3 and the letter to be | 


encoded is ‘X’, then the encrypted version will be A’ 
(because it has been shifted three places to the right). 
Similarly, if the letter to be decrypted is ‘B’, the new 
version will be ‘Y’ (because a shift of 3 places to the 
right has been performed). 

The algorithm must be able to move from the first 
letter of the alphabet to the last and vice versa. 


The ASCII codes for upper case characters run from 
65 to 90. The code for ‘X’ is 88. With a key of +3 this 
would encrypt to 91, but this doesn't represent a 
letter of the alphabet. 

It should encrypt to the letter ‘A’, which has the ASCII 
code of 65. 

Therefore 88 has to be changed to 65. 

There are 26 letters in the alphabet, so you should 
be able to see how to get from 64 to 90 using the 
number of letters in the alphabet. 


Don’t forget it’s the reverse if you are decrypting. 


Start 


Input ‘e’ for encryption 
A or ‘d’ for decryption 


Y 


Is r N 
input ‘e 
or‘d’? 


ie) 


YES s* — 
Input shift to be used + or - anda : 
number of places (between 1 and 25) 


Is key NO | 
appropriate? > 
YES [ee - 
7 
Input mesage to be 
encrypted or decrypted 
7 
' 
Message NO + 
“entered? 
NS 
~ 
YES © 
6 YES Was ‘e’ : NO + 
— >) entered? q———_—-, 
7 a 
Examine each character Examine each character 
in the message in the message 
sg ¥ 
1 Is ita Is ita > 
letter? fetter? 
' , 
+ YES YES + 
Move the letter the ‘key’ Move the letter the ‘key’ 
number of places to the number of places to the 
left or right in the alphabet | | left or right in the alphabet 
T ’ 
J ’ 
NO Is this the Is this the NO 
> last character? last character? ~~ + 
YES YES 
+ Display the encrypted/ a 
decrypted message 
Another YES 
_ Message? —— ceils 
NO ’ 
=» 225 
( End 


Figure 3.14 A flowchart showing a possible way of setting out a 


Caesar cipher 


Leg 


For example, functions are used 
to: 


* ask the user if they want to 
encrypt or decrypt. 

¢ ask the user for the key. 

¢ ask the user for the message. 

* carry out the encryption or 
decryption. 


Each of these functions will return | 
values that are used by the main 
program. 


Sie seb) BO) Il 
WHILE run = 1 DO 


SET method TO findMethod( ) 
SET key TO findKey() 


SET message TO getMessage() 
IF method == “e” THEN 


Structured programming 

Creating the algorithm and coding the program will allow you to use 
structured programming using subprograms, and especially functions, which 
are called from the main program. 


Return encrypt or decrypt { Get method 
+ 
Return key | Get key 
Return message | Get message 
Return encrypted message | Encrypted message 
Return decrypted message | Decrypt message 


Figure 3.15 An example of structured programming 


Here is the main program in pseudo-code — the subprograms are not included. 


#A WHILE loop is started. It will continue until the variable 
‘run’ does not contain the value 1. 


#This calls the function for the user to ask for encryption 
or decryption. 


#This calls the function for the user to enter the key. 
#This calls the function for the user to enter the message. 


SET newMessage TO encrypt(message, key) #Functions to encrypt or decrypt are called. 


ELSE 


SET newMessage TO decrypt(message, key) 


END IF 
SEND newMessage TO DISPLAY 


#The encrypted or decrypted message is displayed. 


SEND “Please enter x to quit or any other key to continue” TO DISPLAY 
RECEIVE response FROM (CHARACTER) KEYBOARD 


IF response = “x” then 


rash bh 9 Pe 
ELSE 
ce iarune rod 
END IF 
END WHILE 


#The user is asked if they want to encrypt or decrypt 
again. If not the variable ‘run’ is changed from 1 and 
the loop will stop. 
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Create and test, in the language you are studying, a program to encrypt 
or decrypt a message using a key entered by a user, 


1 Encrypt the following word using the Caesar cipher with a key of +6. 
Computer (3 marks) 


2 Decrypt the following word using the Caesar cipher with a key of +3. 
Pzfbkzb (3 marks) 


These questions do not require you to give an explanation or description. 
They are asking you to apply the Caesar cipher algorithm to encrypt 

and decrypt. You should transpose the characters to the left or right as 
instructed by the key. 


Summary 


¢ Encryption is needed to ensure that sensitive data is secure and is read only by the intended recipient. 

e Encryption keys are used to encrypt the data into cipher text and decrypt it back into normal (plain) text. 

e Asymmetric encryption uses two different keys and symmetric encryption uses the same one. 

« Inthe Caesar cipher algorithm the letters of the alphabet are shifted a set number of places as defined by the key. 


bh 


$1 Encrypt the sentence below using the Caesar cipher with a key of +9: 
Data should be encrypted. 
S2 Explain how a Caesar cipher algorithm works. 


ie eS ae 


C1 Use a diagram to demonstrate that a shift of +5 followed by a shift of -2 is the same as a shift of +3 when 
applied to the word ‘gold’. 


How confident do you feel about your answers to these questions? If you're 
not sure you answered them well, try the following activity again. 


e For S1 try Activity 16. 
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3.5 Databases 


Learning outcomes 


By the end of this section you should be able to: 
© describe the characteristics of structured and unstructured data. 


° decompose, organise and manage data in a structured database. 


e describe and use the following: 
o tables 
o records 
o fields 
o relationships 
o keys. 


Database: an organised store of 


data. 


People have been recording and storing data since writing systems were 
invented thousands of years ago. 


The first written language was invented 5,000 years ago in Sumer, now in 
modern-day Iraq. 


The rulers carried out a census and made lists of all the people and what 
they owned. This stored information gave them power and allowed them 
to start taxing the people. 


In 1086 William the Conqueror did something similar with the Domesday 
Book. It listed everything that everyone owned so they could be taxed on 
their possessions. There was no argument — if the data was in the book, it 
was the law. 


These books were stores of data, but finding a particular piece of 
information in them was difficult They weren't organised in alphabetical 
order. If an official was looking for a particular person, they might have to 
read many pages before finding the information they needed. 


A database is a data store in which the data is organised in an ordered way 
so that information is quickly retrieved when the data is being searched or 
queried. 


For example, in an encyclopaedia or dictionary the data is in alphabetical 
order. Imagine searching for a word if all the entries were in a random 
order. 


Computer databases 
Computers are ideal for storing and manipulating data because they 
can search and sort the data far more quickly than a human can. 


Storing the data in a structured way makes searching it much faster and 
more efficient. 
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Creating a database 


As an example, consider the following scenario of a school that would like to 
create a database to store details of their learners. 


To set this up some abstraction is needed. 


Identifying the entity and its attributes 


The database is about an object called a learner. It will store details about it. 
Therefore LEARNER is referred to as the entity. 


The next task is to identify the attributes of the entity. 


With respect to an actual learner and their properties that are relevant to 
the school, you might identify the following attributes. 


First name, surname, date of birth, year, tutor group 

There could be many more depending on the requirements of the school. 
This is written as: 

LEARNER(first name, surname, date of birth, year, tutor group) 


The entity is in upper case followed by the attributes enclosed in brackets. 


Creating the database table 
Now that the entity and its attributes have been identified, you can set up 
the structure for storing the data. The structure is in the form of a table. 


In this example the table with some of the learners entered looks like this. 


Fields 


first name surname date of birth tutor group 


Stephen Jackson | 12/5/2004 


Matthew 


Catherine 


A record 


Figure 3.16 A database table showing examples of information about learners 


In the table, the same items of information are being stored about each 
learner. These items of information are called fields. All the information 
about a particular learner is called a record. Each row of the table represents 


one record. 


All of the records in a table make up a file. 
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Activity 18 
Identify five attributes for each of 


the following entities. Write them 
in the format shown. 


CAR, BOOK, DVD 


Entity: something that is 
recognised as being capable of an 
independent existence, is uniquely 
identified and about which data 

is stored. 


It is usually a physical object 
(e.g. a Car, a person or a book), 
but it can be a concept (e.g. a 
customer transaction where 
details of the items that were 
purchased are stored). 


Attribute: an item of data stored 
about an entity. 


Table: a collection of cells 
organised in rows and columns 
used to store data in a structured 
and organised manner. 


File: all of the records in a table or 
a group of linked tables. 


Primary key: a unique identifier 
for each record in the table. It is 
usually a field that is guaranteed 


to hold unique information for 


each record. 


UPI first name 


| 01560 , Stepen 
01356 . Catherine 
01356 Catherine 


surname 


| Jackson 
| Byrne 


Byrne 


Identifying a primary key 

Database management software sets up the tables and manages the records 
when they are stored. It needs to be able to identify each individual record, 

50 each record will need a unique identifier to distinguish it from all the other 
records in the table. 


This identifier is called the primary key. 


In the example table for the school database, none of the fields will 
necessarily have a unique value. There could be many learners with the same 
first name, surname, date of birth, year or tutor group. It would be a strange 
school if the year field had a unique value - it would mean there is only one 
learner in each year. 


An additional field is needed so each record has a unique identifier. 
Education databases solve this problem by giving each learner a unique 
pupil identification (UPI) number when they enter the school system. Here is 
the school database table with the primary key added. 


UPI , first surname date of year tutor 
cies | name — | birth group 
.01560 — Stephen | Jackson 12/5/2004 7 = R11 
01356 Catherine Byrne 13/1/2002 9 Al 

01390 Matthew — Jackson 17/11/2001 49 R2 

01630 | Catherine Smith 12/5/2004 i Rl 


Each record can now be uniquely identified through the UPI field, its 
primary key. 


This table will store the details of all the learners in the school. It can be 
searched to find specific individuals or all the learners in a particular tutor 
group, for example, and print out the list. 


Now imagine that the database is such a success that the school would like it 
to manage the lending of books from the school library. 


It will now need to store all of the learner information plus the details of the 
books they have borrowed. 


Care is needed when designing how this data will be organised and stored. 


Data redundancy 


The data could be organised in the following way, again with the UPI field 
being the primary key. 


date of birth year tutor bookname author borrow return 


group | date date 

| 12/5/2004 = 7 =| R1 | Computing J. Smith 1/10/15 | 15/10/15 _ 

13/1/2002 «9 = Als Gardening | Green 3/10/15 17/10/15 
13/1/2002 «9 ~=9A1 —~Plants F.Green 3/10/15 17/10/15 
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The database illustrates some serious flaws in the way in which the 
data is being stored. 


¢ The same data is being entered more than once - UPI, first name, 
surname, year and tutor group have to be written every time a 
book is borrowed. Also the same book information will have to 
be rewritten when it is borrowed by another person. This is time 


wasteful. Having the same data entered more than once is called 
data redundancy. 


e When the same data is entered more than once, errors can occur. In 
the second version of the database Stephen is misspelt as ‘Stepen’. 
This is known as data inconsistency. 


But there is an even greater problem — the UPI is needed to uniquely 
identify each record. It cannot appear twice in the same table. This 
means that each learner can borrow only one book. 


You could solve this problem by adding lots of new fields so that the 
learners can borrow as many books as they want. 


author 2 borrow return 


UPI first surname date year tutor Book Author_1 |borrow return | book 
name of group name_1 date_1 date_1 name 2 


birth 


Four new fields are needed for each book borrowed. 


e How many new fields should be added? 
¢« How many books will a learner borrow? 
e Will all learners borrow the same number of books? 


There may be too many fields for some learners and not enough for others. 
For many learners some will never be used. 


it is obviously very inefficient to use repeating fields in a database (e.g. book 
name_1, book name_2, book name_3). 


Not enough thought has been given to creating the structure of the 


database. This illustrates the importance of database design. 


Creating a structured database 
The problems are solved by having more than one data table and linking 
them together in a single database. 


The learner table is retained — it could be named tblLearner — and a separate 
one for the books, tblBook, created. 
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date 2 date 2 


| Foreign key: a field in one table 
that uniquely identifies records in 
another table (ie. it is a primary 

_ key in another table). 


Relational database: a database 
that allows data elements in one 
table to be related to any piece 
of data in another table as long 
as both tables contain a common 
element. 


The book table might look like this. 


book number __, title _ author 
03169 | Computing J. Smith 
04675 Gardening F. Green 
05936 Plants F. Green 


The ‘book number’ field is the primary key. A new number is issued when a 
new book is bought for the library. 


The book borrowing is now recorded by creating a new table - tb!Borrowing 
— with the following fields. 


UPI book number borrow date return date 


There are now three tables, but they are linked. The primary key fields from 
the Learner and Book tables are both in the Borrowing table. 


tblLearner 


UPI* 


first name 


surname 


date of birth 


book number 


year 


tutor group borrow date 


return date 


Figure 3.17 An example of a relational database 


The primary key fields in tblLearner and tblBook are indicated with asterisks. 


tblLearner and tblBook are linked to tblBorrowing because their key fields 
are included in it. tol|Borrowing is said to be a link table. 


When a key field from one table is included in another it is called a 
foreign key. 


By linking the tables we have structured the data to create a relational 
database. 


Whenever a learner borrows a book, their UPI, the book number and 
the borrowing and return dates are entered into a new record in the 
Borrowing table. 


This allows the librarian to create a search or query, linking the three tables, 
to find overdue books and the learners who have borrowed them. 
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If a book is overdue, all the librarian has to do is identify the learner from 
their UPI by looking in the Learner table and the book from its book number 
by looking in the Book table. They can send an overdue letter to the learner 
stating the book borrowed and when it should have been returned. They can 
also work out the fine. 


Compound keys 

A key field has not been identified for tblBorrowing. 
tblBorrowing | If the UPI is chosen as the unique identifier, then each 
UPI learner will only be allowed one record in the table. They 


could only ever borrow one book. 
book number 


If ‘book number’ is chosen, then each book will only be 
allowed to be borrowed once. That would be a waste 


This problem can be solved by having a compound key. | Compound key: a key that 
consists of two or more fields 
used to identify a record uniquely. 


borrow date 


tblBorrowing 
— + --—— In tb!Borrowing both UPI and book number could be 

UPI* combined into a compound key. So learners will be 

book number* able to borrow more books because their book number 

fields will be different and books can appear more times 

in to|Borrowing as long as they are combined with a 

return date different UPI. 


borrow date 


There is still a problem. What if a learner wants to reread a book? The database 
will not allow this as the UPI and book number fields will contain the same data 
as in the previous record. 


tblBorrowing This is solved by expanding the compound key to include 
the ‘borrow date’ field. Now a learner can borrow a book 
UPI* oe 
again as long as it is on a different day. 
book number* 
An efficient database has been created by carefully 


borrow date* structuring the data storage. 
return date 


Table relationships 

Each learner appears only once in tblLearner, but can appear many times 
in tblBorrowing. 

Each book appears only once in tb|Book, but also can appear many times 


in tblBorrowing. 
J et Extend your knowledge fi, 
There are therefore ‘one-to-many’ relationships between the Learner and 


The process of structuring 
the data to create relational 
databases is called normalisation. 


<{ thiorrowing «= Find out about first, second and 


third normal forms. 


Book tables and the Borrowing table. 
This is shown as follows. 


Figure 3.18 A one-to-many table relationship 
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Activity 19 


authors. 


ae Le? | 
BOOKS 
| Book_ID* 


|| Title 


“Publisher 


| Date Published 


L 
f 


Author_Surname 


| 


| Author_First_name 


| Author_DOB 


_ Author_Nationality 


She has created the following table: 


When she starts to enter data, she creates records as shown below. 


Catherine is creating a database. She wants to store details of all her books and also information about their 


| Book _ID | Title 


Publisher 


=i 


Date _ 
Published 


Author _ 
Surname 


Author_ 
First_name 


— 


| Author _ 
Nationality 


| Author _ 
DOB 


1 Harry Potter 
and the 


Stone 
r — a — 
Harry Potter 
a and the 
Chamber of 


! 
Secrets 


Harry Potter 
| and the 
Prisoner of 
Azkaban 


a data redundancy 


Philosopher's 


H 
ae 


Bloomsbury 


1997 


Rowling 


Bloomsbury 


_ Bloomsbury | 


| 1 Explain what is meant by: 


b data inconsistency. 


1998 


Rowling 


t 


Rowling 
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JK 


JK 


— 


| 
| 31/07/1965 | British 


31/07/1965 | British 


31/07/1968 British 


2 Restructure Catherine's database design to remove the problems listed in Question 1. 


Unstructured data 
Most of the data in existence today is not stored in structured databases. 


It is unstructured. in other words, it has no pre-defined structure. XtraSave supermarket allows 
customers to order online and 
Sources of unstructured data include things such as emails, word- have their orders delivered. 


processing documents, Microsoft® PowerPoint® presentations, images, 


; XtraSave would like to be able 
spreadsheets, uploads to social networking websites and sensors. 


to print out reports showing the 


Not surprisingly, unstructured data is growing significantly faster than following details for each order: 
structured data. The amount of unstructured data doubles roughly every * the name and postcode of the 
three months. This creates major challenges for organisations — how do customer 

they store such vast quantities of data and how do they extract useful * the date of the order 


information from it? e the items that were ordered 


Unstructured data is a rich source of valuable information that can help * the name of the delivery driver. 

organisations make better decisions and give them a competitive edge. It wants to avoid data redundancy 

However, extracting information from unstructured data is much more as customer name and postcode, 

difficult than searching a structured database. To overcome this challenge, items ordered and the delivery 

a number of powerful data mining tools have been developed. Some driver name will need to be in 

are capable of analysing text and natural language to identify a person’s more than one order and they 

likes and dislikes, feelings and priorities. Others use geospatial data from do not want to have to input 

satellites, GPS systems and sensor networks to analyse trends and make the customer, item and delivery 

predictions — areas where the population is set to grow, potential crime details more than once. 

hotspots, regions at risk from natural disasters etc. Business analytic tools Design the table structure 

are used to identify customer preferences, emerging markets, consumer required, indicating the key 

trends etc. fields, and show how the tables 
are linked. 


A sports centre uses a database to store details of their members. 
MEMBER, SPORT and TEAM are three of the tables used in the database. 
1 The data for the first three members in the MEMBERS table is shown below. 


MemberNumber Surname FirstName DOB _ SportCode _ TeamCode 
00033 Abbot | Catherine | 13/1/1980 | HCK | HCK_A 
00045 Ali Mohammed _ 10/10/1995 | FBL | FBL_C 
00069 - Anderson ; Patricia | 6/6/1994 ~ HCK (ene Xe 


a State the field that is used for the primary key in the MEMBER table. (1 mark) 
b Explain why your chosen field is most likely to be used as the primary key. (2 marks) 
2 The database also contains a SPORT and TEAM table. 
Explain why SportCode and TeamCode have been included in the MEMBER table. (2 marks) 
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1 You should explain what is meant by a primary key and why the field you 
choose is the only one that could contain unique data. 

2 You should explain how relational databases function and how tables can 
be linked using foreign keys so that data does not have to be duplicated. 


A database is an organised store of data. 


An entity is something with an independent existence that is uniquely identified and about which data can 
be stored. 


A table is a collection of rows and columns that are used to store data in a structured and organised manner. 
A field is one item of information in a database table. 


A record consists of all the fields about an individual entry in a database (e.g. all the details about one learner in 
a table). 


The primary key uniquely identifies each record in the table. It is usually a field that is guaranteed to hold 
unique information for each record. 


A relational database allows data elements in one table to be related to any piece of data in another table as 
long as both tables contain a common element. 


Massive amounts of unstructured data are produced every day. Unstructured data is information that isn’t 
organised in a pre-defined structure. Examples include tweets, photos, emails and output from sensors. 


Organisations use powerful analytic software tools to extract valuable information from unstructured data. 


Strengthen = 


S1 Describe the property needed by a field if it is selected as the primary key. 

S2 Explain why the key field from one table may be included in another table. 

$3 Explain how unstructured data differs from structured data. 

cra pares ee —_ 


ME | “ie 


C1 Explain how relational databases minimise data redundancy and data inconsistency. 


C2 Explain what is meant by the term ‘big data’ and why the effective use of big data gives organisations a 


competitive advantage. 


How confident do you feel about your answers to these questions? If you're 
not sure you answered them well, try the following activities again: 


* For S1 have a look at the definition on page 132. 
* For S2 have a look at page 134. 
* For S3 have a look at page 137. 
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Chapter 4 


4.1 Machines and computational modelling 


Learning outcomes be 


By the end of this section you should be able to: 


¢ describe the input-process-output model. 


The input-process-output model 
How many computers do you have in your house? There are probably a lot 
more than you think. Let’s find out why. 


een," 


How many of these things do you think of as computers? 


You might be surprised to know that every device shown above contains 
an embedded computer. A computer is a machine that takes some kind of 
_ Input: to enter data into a input from its surroundings, processes the input according to given rules 
| computer. and provides some kind of output. 

Process: to change the meaning 
, or format of some data. 


, 


Output: to display or output | 
' data that has been processed (or | INPUT 


|——_—_»| PROCESS 
_ has been stored). —— OUTPUT 


Figure 4.1 A simple input-process-output diagram 
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You are probably familiar with a desktop computer: the keyboard provides 
input and the screen provides output (which depends on what you type). Ina 
similar way, the embedded computer in a washing machine has controls that 
provide input and a motor and heater as outputs; how fast they spin and how 
much heat they produce depends on the input setting. 


1) Name any other inputs and outputs for 
a adesktop computer 
b awashing machine. 


2 Sid is a central heating installer. Research the computing devices he 
might use in his job. 

3 Discuss in a group the inputs and outputs for the devices shown on 
the previous page. Think of some categories that you could use to 
group the input and output devices you come up with, for example 
pointing device or audio input device. 


The clever bit in computer science is the activity that goes on between the 
inputs and outputs — the processing. Processing means performing a series 
of actions on the inputs according to a given set of rules. You might have 
met this idea in maths already if you have studied ‘function machines’. You 
might also have processed ingredients according to a recipe to make a cake; 
or put together steps according to a choreography to make a dance; or 
musical notes in harmonic combinations to make a song. 


In Chapter 1 (page 2) an algorithm was defined as a precise method for 
solving a problem, consisting of a sequence of step-by-step instructions. 

In order for processing to produce meaningful results it has to follow an 
algorithm. So you might also think of an algorithm as the sequence of steps 
that is used to process the input to produce the output. 


INPUT OUTPUT 


3 SS .——> +2 _—_ PP 8 


Figure 4.2 A simple maths function 


If you try to follow a more 4 
complicated function machine ——> PROCESS 
than this one you will probably | ~ 


need a notepad to keep track of 
your working. In a similar way, [ 
anything beyond the simplest 

algorithm needs temporary 

storage or memory. We should STORAGE 
change our diagram of a 
computer to include this. 


i 


List three outputs and three 
_ inputs of a smartphone. 
| (6 marks) 


A touch screen is an input and 
output device so could appear 
twice in your answer. 


——| «OUTPUT 


Figure 4.3 A simple function machine with storage 
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Indicates an 
input or output. 


Indicates a process 
to be carried out. 


Figure 4.4 Flowchart symbols for 
input/output and process 


Extend your knowledge 


Make a table with the headings 
‘Input’, ‘Process’ and ‘Output’ 
and with your school subjects 
down the side. See if you 

can find an example for each 
subject starting with the ones 
in the text, like this: 


sd 


_FoodTech | ingredients | 


recipe 


For each row think about 
whether the rules or 
instructions for the processing 
are similar to or different from 
those we find in computer 
science. If they are different, in 
what ways are they different? 


Activity 3 ; 


Code the password algorithm 

in the high-level programming 
language you are learning — you 
could extend it to include a 
‘medium’ output for passwords 
that meet only two of the criteria. 


L 
| 


As your algorithms become more complicated it can be useful to show the 
steps in a flowchart. These are the flowchart symbols that you met in 
Chapter 1 (page 3). 


There are many examples in history of people trying to discover the 
processing that a system is doing by changing the inputs and seeing 
what outputs are produced. For example, Gregor Mendel discovered ideas 
about genetics by breeding different varieties of pea plants and seeing 
what the characteristics of the new plants were; scientists at the Large 
Hadron Collider are smashing particles together in different ways, piecing 
together the laws of nature by analysing what comes out. 


1 Work with a partner to write a program to implement a function 
machine. Decide the mathematical steps required, then ask your 
partner to write the program. You should check your partner's 
program produces the right answers with some example inputs. Do 
the activity again, but swap roles. 


2 The sentence below is a written description of an algorithm that ; 
checks the strength of a password. Identify the inputs and outputs for 
this program and describe the processing that takes place. 


Ask the user to enter their password. If the password is eight or more 
characters long and contains at least one upper case letter and at least 
one symbol then display the word ‘Strong’, otherwise display the word 
‘Weak’. 


Summary 


* Although computers come in all sorts of shapes and sizes they all do 
principally the same thing. They take in inputs, process them according to a 
given set of rules, and generate outputs. 


* Algorithms expressed in program code specify the processes a computer 
carries out.: 


Strengthen a 
——— 


$1 Describe what is meant by the term ‘input-process-output model’, 


S2 Identify the inputs, outputs and processing involved in uploading an 
image to a social networking profile. 


Challenge —_ 


C1 Write a computer program that prompts the user to type in three 
numbers, adds them up and then displays the answer on the screen. 
Explain your program using the input-process-output model, 
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4.2 Hardware 


By the end of this section you should be able to: 


* describe the function of the hardware components of a computer system (CPU, main memory, secondary storage) 
and how they work together. 


¢ describe the function of different types of main memory (RAM, ROM, cache). 


° describe the concept of a stored program and the role of components of the CPU — control unit (CU), arithmetic/logic 
unit (ALU), registers, clock, address bus, data bus — in the fetch-decode-execute cycle (the von Neumann model). 


* explain how data is stored on physical devices (magnetic, optical, solid state). 
* describe the concept of storing data in the ‘cloud’ and other contemporary secondary storage. 
° describe the need for embedded systems and their functions. 


Stored programs: the von Neumann model Toptp | 


You probably have a calculator app on your smartphone, but your calculator 
almost certainly can't play Candy Crush Sorgen eee 
sorts of shapes and sizes, but 


Your calculator is designed to do just one job —- sums. Your smartphone on | it’s worth remembering that 
the other hand is a general-purpose computer — it can be programmed to the four functions they all have 
do a number of different jobs, from sums to web browsing to games. | in common are input, output, | 


storage and processing. 
The first computers, like your calculator, had to be redesigned or at least | 


rewired to do a different job. The idea of a general-purpose computer 
developed in the 1930s and 1940s through the work of Alan Turing and John 
von Neumann who proposed that the instructions for the processing could 
be held in storage with the input data. 


One of the first electrical, rather than mechanical, computers was ENIAC. 
Like many technological developments its original purpose was military, 
as it was used to determine the flightpaths of artillery shells. It could 

be, and was, put to other uses, but this required changing its internal 
wiring along with a large number of switches. This reprogramming was 
considered to be secretarial work and, this being the 1940s, the job was 
assigned to six women. They are now considered to be amongst the first 
computer programmers. 


Produce a leaflet describing the pioneering work of the ENIAC 
programmers. Add the details of any modern, influential female computer 
scientists to your leaflet. 
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von Neumann architecture: 
computer system design in which 
the program is stored in memory | 
with the data. 


Central processing unit (CPU): 
| hardware device that carries out 
the processing in a computer. 


_ Main memory/random-access 
memory (RAM): a temporary 
store for data and instructions 
(programs). 
Bus: a group of connections 
between devices in a computer. 


Fetch-decode-execute cycle: 

sequence of steps carried out 

repeatedly by a CPU. 

Writing: when the CPU sends data 
, to memory to be stored at a given 

address. 


Reading: when the CPU retrieves 
the data stored at a given address. 


| 


Memory address: a number that 
_ uniquely identifies a (memory) 
storage location. 


Activity 5 
Look online for an animation of 
the fetch-decode-execute cycle. 


Try to describe the animation ina 
step-by-step way. 


Activity 6 ? 


Ld 
Use a graphical programming 
language such as Scratch to 
produce your own similar 
animation. 


A computer in which the processing instructions are stored in memory 
with the data is called a stored-program computer or a von Neumann 
architecture computer. 


You will remember from Chapter 3 that all types of data (including pictures 
and sounds) are stored in memory as binary numbers. In the same way, the 
processing instructions are written as binary numbers. You will learn more 
about this in section 4.5 on page 167. 


In avon Neumann architecture computer the hardware device that does the 
processing is called the central processing unit (CPU) and the storage is 
called main memory/random-access memory (RAM). They are connected 
to each other, and to the input and output (I/O) devices, by a group of 
connecting wires called a bus. 


Bus 


Figure 4.5 Buses used in a computer 


When a computer is turned on, the CPU fetches an instruction from the 
memory, carries out the instruction it receives, and then fetches the next 
and so on. This sequence is called the fetch-decode-execute cycle and it 
carries on until the power is turned off. 


Hardware components of a computer system 


RAM and ROM 

If you have ever bought a desktop or laptop computer you will have seen 
the main memory referred to as random-access memory (RAM). This is the 
temporary store that the CPU uses for data and instructions (programs). 


Remember that both data and instructions are just binary numbers, usually 


in bytes (8 bits). RAM is typically measured in gigabytes (GB). See section 3.3 
(page 117). 


When the CPU saves data into memory this is called writing: the CPU 
uses the bus to tell the memory what data to save and where in memory 
to save it. The reverse process is called reading and the CPU must specify 
which part of memory to read from. Each memory location has a unique 
memory address, in the same way that your home is identified by an 
address. You can think of memory like a series of letter boxes each with 
space for one byte of data. The memory addresses are just numbers, 
starting from zero, labelling each mailbox. 
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Writing 
Address 3 11101000 
¢—————— 
Address 2 00000000 
’ CPU 
Address 1 10010111 | ——--_—»> 
Address 0 01101001 Reading 


Figure 4.6 Memory addressing 


RAM is described as volatile which means that its contents are lost when ‘Keyterms 
the power is turned off. Because of this, computers also need non-volatile 


memory to store any programs that must run when the computer is first | Volatile: memory that is erased 
turned on - this memory is called read-only memory (ROM). Programs when the power is turned off. 
permanently stored in ROM are known as firmware. Typically these programs | Non-volatile: memory that is not 


are small and carry out specific tasks, such as initialising the hardware lost when the power is turned off. 
components and starting the operating system when a computer is switched 
on. You might have heard of these programs being called the BIOS (short for 
Basic Input/Output System) or UEFI (short for Unified Extensible Firmware 
Interface). Nowadays, UEFI has replaced BIOS as the standard firmware used 
in desktop computers. 


| Read-only memory (ROM): 
memory that cannot be altered 

and is not lost when the power is 
turned off. 


The photograph below shows the main circuit board of a 1980s computer (the 
Sinclair ZX81). The ZX81 had either one or two RAM chips. Two are shown in 
the diagram along with three other chips: CPU, ROM and a fourth chip that 
connects the input and output devices. You can also see groups of parallel 
silver ‘tracks’; these are the buses (the wires connecting the chips together). 


The photograph on the right shows a modern-day computer (the Raspberry 
Pi). The chip in the centre contains both the CPU and RAM as well as 
handling some of the |/O, such as the screen. The role of the ROM is replaced 
by a plug-in memory card (not shown). The smaller chip to the right handles 
the slower I/O devices such as USB (Universal Serial Bus). Buses can still be 


seen although a bit less clearly. 


Computer circuit boards: Left — Sinclair 2X81; Right — Raspberry Pi 
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- Cache: memory used to 

; make up for the difference in 

_ speed between two internal 
components. 


| Activity 7 ? 


Research the two chip 
manufacturers Intel and 

ARM. Compare how their 
microprocessors are designed 
and where they are used. 


Activity 8 


Produce a presentation to explain 
what is meant by print spooling 
and deliver your talk to the class. 


Cache memory 

Most computers require a third kind of main memory: this is called cache 
memory (pronounced like ‘cash’). Cache memory is a small amount of fast, 
expensive memory that is used in-between two devices that communicate 
at different speeds, most often the CPU and RAM. 


Figure 4.7 Diagram of cache memory 


In a CPU cache, frequently used code or data is loaded in chunks from the 
slower RAM into the cache. The CPU accesses the cache memory at its own, 
faster speed. This means the CPU isn’t slowed down by having to wait for 
data from RAM. 


Extend your knowledge 


In computers up to around the 1980s CPUs and RAM worked at the same 
speed as one another, but CPU technology developed much faster than 
RAM. This meant that the RAM in newer computers could no longer 

keep up with the demands of the CPU, causing a bottleneck that limited 
the performance of those computers. To solve the problem, engineers 
introduced cache memory, but this created a new challenge. If the data 
the CPU needs is not already in the cache (called a cache miss), then the 
CPU has to wait for the cache to reload with the correct data. A lot of 
research has gone into ways of predicting which chunk of RAM to load 
into the cache at any time to reduce cache misses. 


Extend your knowledge > 


Most I/O devices have memory between themselves and the main system 
because they communicate at different speeds; this is often called a 
buffer (e.g. disk buffer, print buffer or frame buffer (for graphics)). Data is 
transferred at high speed between a processor and this memory. The CPU 
can then continue with other tasks while the I/O device takes data from 
its buffer at a slower rate. 
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What is cache memory used for? 
Describe the difference between RAM and ROM. Suggest a use for ROM. 


3 Make a cartoon strip or storyboard showing the fetch-decode-execute 
cycle. 


4 Look back at the photographs and descriptions of the ZX81 and 
Raspberry Pi. Working with a partner, identify the similarities and 
differences between them. After 35 years, what has changed and what 
is the same? 


Fetch-decode-execute: in detail 


There are many different kinds of CPU, but they share some features in 
common. To understand the fetch-decode-execute cycle in more detail you 
need to know a bit more about what goes on inside the CPU. 


The part of the CPU that does calculations and logic operations is called the 
arithmetic/logic unit (ALU). 


Inside the CPU are a number of memory locations called registers. These 
are extremely fast to access but there are usually only a relatively small 
number of them. Some registers play a specific role in the fetch-decode- 
execute cycle, for example holding a memory address, an instruction ora 
piece of data, while others are general purpose. Some of the registers with a 
specific role that are found in most computers include the following. 


- Accumulator: holds the results of calculations performed by the ALU. 
All input and output from the CPU passes through the accumulator. 


Program counter: holds the memory address of the next instruction 
to be fetched. 


Current instruction: holds the instruction currently being executed. 


The steps in the cycle are controlled by a control unit and synchronised by 
an electronic clock. You have probably seen the advertised clock speeds of 
CPUs; for example, a 2.2 GHz CPU means that its clock ‘ticks’ 2200 million 
times per second. 


we ee FE: 
OFF (0) 
Figure 4.8 Clock signal 
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Arithmetic/logic unit (ALU): the 
part of the CPU that performs 
calculations and logic operations. 


' Register: a storage location 
inside the CPU used to hold an 
instruction, an address or other 
single item of data. 


Control unit: the part of the CPU 
that organises the actions of the 
other parts of the CPU. 


Clock: an electronic device 
inside a CPU that ‘ticks’ at 
regular intervals and is used to 
synchronise the actions of the 
other parts of the CPU. 


| 


} 


' Bus width: the number of | 
wires that make up a bus - this 
determines the range of binary 
numbers that can be transmitted. » 


The bus that connects the CPU to other devices in the computer is split into 
three parts: the address bus, the data bus and the control bus. The address 
bus carries memory addresses between the CPU and memory to identify 
uniquely a memory location; the data bus carries the value to be read from or 
written to memory; and the control bus carries the signals that, for example, 
determine whether to read or write the data, and when to do so. 


re 


— = Ps 
! 
{ 


{ Control 
= 


Figure 4.9 Inside a CPU 


The number of connections on a bus is called the bus width. Since each 
connection represents a binary digit, a 1 or a 0, a greater bus width means larger 
number values can be communicated. For example, an eight-bit address bus can 
send values from 00000000 (0) to 11111111 (255), only 256 possible addresses. 
By contrast, a 32-bit computer can address up to 4 GB of RAM. 


Now that you know a little more about the parts of a CPU you can understand 
in more detail what happens at each step of the fetch-decode-execute cycle. 


Fetch 
The CPU control unit places the memory address of the next instruction 
on the address bus. It also sends a signal on the control bus requesting 
to read from memory. 
The memory receives the signal and looks up that memory location. The 
data in memory is copied on to the data bus. The CPU copies this into a 
special register. 
Decode 
| The control unit analyses the contents of the register and sends signals 
| to the other parts of the CPU telling them what to do (e.g. add numbers 
store data back into memory). 
Execute 
The instruction is completed by the CPU. 
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Extend your knowledge bh 


You are probably wondering how you get computer games just from copying, 
adding and storing binary numbers. First, remember that the I/O devices 
are also connected to the bus and they have their own addresses; storing 
numbers in their addresses can make pixels appear on a screen or sounds 
come from a speaker. Second, remember that modern CPUs can do billions 
of these simple instructions per second. What appears to be very complex 
behaviour is actually just a very large number of very simple operations. 
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, 1 In groups of three, stage a debate on the best way to improve CPU 
performance. One person should research and speak on behalf of 
clock speed, another cache size and the third on type and size of RAM. 
2 Work with a partner to produce definitions cards for the keywords in 
this chapter. 
Use your cards to play a describing game. See if you can describe the 


keyword on your card to your partner without using the words in 
the definition. 


Secondary storage 


In most situations you need to be able to store your data and programs 
after the power is turned off. You have learned that RAM is volatile so most 
computers need to be able to copy the contents of their RAM to another 
kind of storage that is not volatile - a type that doesn't lose its contents 
when there is no power. This more permanent storage is called secondary 
storage. Secondary storage is non-volatile and, coiiipared to RAM, is slower to 
access, cheaper and has much higher storage capacity. Typical capacities are 
now in the terabytes, i.e. millions of megabytes. 


You remember that all data is stored as binary: 1s and Os. There are three 
physical ways of recording 1s and Os that do not require power. 


Magnetic storage uses the fact that magnets have north and south poles. By 
magnetising something, the north and south poles can represent the 1s and Os 
of your data. This is used in hard disks and magnetic tape storage. 


Optical storage is used by CDs and DVDs. Shinier or more reflective parts of the 
disk represent the 1s or Os. 


Solid-state storage or ‘flash’ memory (such as USB memory sticks or SD cards) 
represents the 1s and Os by little pools of trapped electrons on a microchip. 


Magnetic secondary storage - hard disks 
Inside a hard disk drive is a stack of disks called platters with a magnetic coating 
on each surface. Tiny magnetic recording heads on the end of an arm float 


a millionth of a centimetre above the disk spinning at 110 km/h underneath. 


Data is recorded on each disk along circular tracks, each split into smaller 
parts called sectors. 

When data is read: 

1 Thearm moves across to be above the right track. 

2  Therequired sector comes around under the head. 

3 The magnetised surface induces a tiny current in the head. 

4 Thedisk controller translates this into 1s and Os. 


Because of these steps, data does not come from the disk immediately. Each 
step takes some time. The time for step 1 is called the seek time; for step 2 it is 
called the latency. 
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- Secondary storage: any kind of 
permanent storage to which the 
contents of ROM/RAM are copied 
(usually a hard disk, optical or 
solid-state device). 


Magnetic storage: secondary 
storage that works by 
magnetising parts of a substance 
as north and south poles to 
represent binary 1s and Os. 


_ Optical storage: secondary 
storage that works using 
differences in light reflection 
from a material. 


Solid-state storage: secondary 
storage that works by storing 
charge (electrons). 


Protective material 
Reflective layer 
Recording material 
Data bits 


Control groove (spiral) 
Transparent bottom 


Figure 4.10 The structure of a DVD 


Optical secondary storage - DVDs 

ACD, DVD or Blu-Ray™ disc is made of several layers as shown in Figure 4.10. 
Data is written along a single track that spirals out from the centre of the disc. 
The player uses two lasers: one to write data on to the disc and the other to 
read data. 

When writing, one laser heats the recording material, which creates non- 
reflective depressions on the surface of the disc, called pits. This is slow 
because the heating and cooling of the surface takes a little time for each bit 
of data. 

If a CD is manufactured ready written, it has pits in the recording layer that are 
less reflective than the flat parts, called lands. 

When reading, the laser reflects differently off the surface and this is detected 
by a light sensor. 


The spiral track is longer on the outside so when reading data from the outside 
edge the disc has to reduce its rotation speed so that the data passes under 
the laser at a constant speed. 


When data is read: 
1 The disc spins in the drive to ensure all data can be read. 


2 The tracking mechanism moves the laser into the correct position over 
the disc. 


3 The laser shines on to the disc and is reflected back on to a light sensor. 
4 Signals from the sensor are translated into 1s and Os. 


Electrical secondary storage - flash drives 


Solid-state storage uses chips, called NAND flash, comprised of special kinds of 
transistors that can trap electrons in a ‘pool’. Electrons in a pool represent the 
Os of the data, while empty pools represent 1s. 


Because billions of transistors will fit on a chip, solid-state devices are small 
and have high capacity, often several gigabytes, as you will know from the 
memory sticks you carry around in your pocket. They also give much faster 
access to read data than hard disks. 


When data is read from the chip: 

1 Control signals identify which bit is to be read out and apply a small voltage. 
If the electron pool is empty the transistor turns on and a 1 is read out. 

If the electron pool is full the transistor doesn’t turn on and a 0 is read out. 
The control signals are changed to read other bits. 


When data is written to the chip, control signals identify which bit is to be 
written and apply a higher voltage. 


6 This pulls electrons into the pools of those transistors, recording the 1s 
and Os. 
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Erasing data also requires higher voltages to remove electrons from the pools. 
Because of this, both erasing and writing cause the transistor to break down 


slowly; so flash drives can only be rewritten perhaps 1 million times before 
eventually failing. 
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Silicon chip 


Figure 4.11 Flash drive 


Both magnetic hard disks and solid-state flash memory are developing 
rapidly and will continue to increase in capacity and decrease in size for 
several years. The largest computer companies are currently researching 
some brand new technologies that might become important in the 
future. For example, HP is experimenting with a new type of electrical 
component called a memristor to create super-fast, high capacity, non- 
volatile memory chips. IBM is developing race track memory - a type 

of magnetic storage that stores information as magnetic patterns on 
tiny wires, providing much faster data access speeds than current hard 
drives or flash disk technologies are capable of. You might also have 
heard of quantum computing. Quantum computers store data as qubits. 
Qubits can be individual atoms and each one is capable of storing (and 
processing) more than one bit at once. 


Activity 11 fy 
1 What is the purpose of secondary storage? 


2 Name one kind of magnetic storage and one kind of solid-state 
storage. 


3 Monty runs his own gardening business. Explain which kind of 
secondary storage you would advise Monty to use for: 
a storing his garden design software 
b taking garden designs to show potential customers 
c  backing-up all his financial records for seven years. : 


4 Produce a presentation about future storage: you can start your 
research by searching online for the keywords in the text (e.g. 
memristor, qubit). 

5 If you have an old hard disk you no longer need, try taking it apart 

| to look at some of the components you have learned about. Your 
tutor might have one to show you; you could also ask your school’s IT 
technician. 


Cloud storage 

Sometimes it can be useful to have your secondary storage in a different 
place from your computer. For example, you might want to have a back-up 
copy of your data in a secure location or share your data with users of other 
computers; or you might want to be able to access your data when you are 
not sitting at your usual computer. 


In Chapter 5 you will learn more about how networks can be used to share 
computing resources. Computers that share their secondary storage across a 
network are called file servers. You are probably used to using the secondary 
storage on a server on your school network (your home drive or a learner 
shared area). Some hard disk drives are able to connect directly to a network 
(called NAS or network-attached storage). Others connect to several servers 
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Cloud storage: secondary storage, | 


often belonging to a third party, 
that is accessed via a network, 
usually the internet, and so is not 
in the same physical place as the 
machine’s RAM/ROM. Files stored 
‘in the cloud’ can be accessed 
from anywhere via an internet 
connection. 


Virtualisation: any process that 
hides the true physical nature of 
a computing resource, making it 
look different, usually to simplify 
the way it is accessed. 


SAN 


NAS 


Figure 4.12 Cloud storage diagram 


using their own storage area network or SAN. This type of arrangement is 
useful when it’s important not to lose access to your data, for example ina 
banking system. If one server or hard disk fails, another can take over its job 
without users of the system even noticing. 


Sometimes storage is accessed via the internet using internet services such as 
Dropbox or Google® Drive. In this situation it is usually called cloud storage. 
This is an example of virtualisation: you seem to have access to a single, 
virtual hard drive when in fact your storage might be spread across several 
servers around the world, with software dealing with the details of where each 
part of each file is actually stored. 


Storing data on servers accessed via the internet has the advantages that 


* you can access the data from anywhere on many devices using a web 
browser. 


¢ the data is securely backed up by the company providing the storage 
service. 


¢ you don't need to transfer your data if you get a new computer. 


Some people will be concerned about security: you might, for example, have 
read about high-profile cases of private photographs being accessed and 
released on the internet. Other people might be uncomfortable with not 
being able to have their data ‘in their hand’ and worried about losing access 
if they lost their internet connection. See section 5.2, page 193, for more on 
cloud security. 


server 


Cloud 


Internet 


Your computer 
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Embedded systems 


An embedded computer system is one that is designed to do a specific 
job, usually as part of a bigger system that might have other electrical or 
mechanical parts, such as a washing machine. This makes them distinct 
from general-purpose computers like the PC or laptop you use every day. 
Embedded systems are all around us in every part of daily life, for example 
in your cable or satellite TV box, your car, games console, traffic lights, 


toaster or toys. They might be so simple that you don’t even think of them 
as computers. 


Activity 12 =e 


Find out about the storage 
systems used on your school 
network. Research, for example, 
what ‘drives’ are available in My 
Computer. Does your school use . 
a SAN or NAS? Where are the 
servers located? How does this 
compare to using a service such 


Embedded computers are often cheap, low-power devices using simple as Google Drive? 

technology and limited memory. They might run just one program, which is Write a short report giving the 
very rarely updated, and so is stored permanently in firmware (ROM or flash advantages and disadvantages of 
memory). They often use sensors of some kind. different systems. 


1 Modern cars contain an engine management system. This is an 
example of an embedded computer system. 
a Suggest two inputs to and two outputs from this system. 
b_ Explain one function of the engine management system. 
2 Make a table listing five other embedded systems you might 
encounter in your daily life. You could look back to page 140 at the 
beginning of this chapter to get some initial ideas. 


In recent years wireless network technology, the internet and embedded 

systems have come together (converged) so that many devices can now 

communicate and be accessed and controlled remotely via the internet. This 

has become known as the ‘internet of things’. For example: 

¢ your printer in school might automatically order new toner when it is 
getting low; 

* your electricity meter might automatically send readings to the supplier 
via the internet; 


° you might be able to control your central heating from your smartphone. 


Imagine a future where your fridge knows when you're running out of milk 
and sends you a text telling you to pick up some more on the way home. 
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1 State what is meant by the 1 Write acomputer program that simulates a traffic light control system. 
term volatile memory. | It must cycle through the normal sequence of lights, printing out the 
(1 mark) | current state. Your tutor might give you the opportunity to use your 

2 Explain why having cache | program to control a real light output. 
eee?) ere ie | 2 Drawa flowchart showing the processing required by a chip-and-PIN 
Bavoryities guinet rc) | credit card reader. This is used when you pay for something in a shop: 
cemanks) | it collects data about the purchases from the till, gets your card details, 

3 List the steps in the fetch part checks your PIN, sends encrypted data to the bank and waits for a reply; it 
of the fetch-decode-execute might print a receipt if the transaction is allowed and informs the till 
cycle. (4 marks) | of the result. 


In Question 2, remember that * von Neumann computers process data using a stored program. 
‘explain’ means you have to give « ACPU runs a program by accessing main memory using the fetch-decode- 
a reason as well as your answer. execute cycle. 


e Main memory can be volatile RAM or non-volatile ROM. 


¢ Cache memory is used between devices that communicate at 
different speeds. 


¢ CPU, memory and I/O devices are connected using a bus. 

¢ Inside the CPU, registers provide storage and the ALU performs calculations. 
* Secondary storage can be magnetic, optical or solid state. 

¢ Secondary storage can be located away from your computer in the cloud. 


* Embedded systems are cheap, low-power computers that are dedicated to 
a specific task. 
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$1 Describe in detail the fetch-decode-execute cycle. Include the roles of the memory, buses and special registers. 
S2 Describe the advantages and disadvantages of different kinds of secondary storage. 
S3_ Explain what is meant by the term internet of things. 


C1 Describe the operation of a von Neumann architecture computer. 
C2 Compare the features of CPU registers, cache memory and RAM. 


C3 Explain the purpose of secondary storage and justify an example of a situation where Optical storage is 
the best choice. 


C4 Evaluate the use of embedded systems to produce ‘driverless cars’. 


How confident are you in your answers to these questions? If you are not sure that you 
answered them well, go back and review the fetch-decode-execute section on pages 147-148. 
Try to explain this to a family member at home: this will help you to understand it yourself. 
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4.3 Logic 


By the end of this section you should be able to: 


* construct truth tables for a given logic statement (AND, OR, NOT). 
* produce logic statements for a given problem. 


Truth tables 


You learned in Chapter 1 (page 9) about selection - determining which 
parts of a program run depending on certain conditions — usually 
expressed using IF statements. 


IF health <= 0 THEN 
SET game _ over TO true 
END IF 


In general, this kind of statement has the form: 


IF CONDITION THEN COMMANDS 


When this code is translated the logic statement or condition is calculated. If 
the result is TRUE then the commands will run. 


Here is a more complicated condition. 


IF health <= 0 AND lives = 0 THEN 
SET game over TO true 
END IF 


Truth table: a table showing 
all possible combinations of 
the inputs and outputs of an 


You can probably work out from this code when the game is over because 
you will have played games and understand what ‘health’ and ‘lives’ 
mean. But how does a computer do this? A computer needs to be able to 
calculate AND to work out if the condition is TRUE or FALSE. operator. | 


In the example the game is only over if both parts of the condition are “health <=0  lives=0 Game over? 
true. We can show all the possibilities for this condition in a table. 
No No wee 

This kind of table is called a truth table. It is the truth table for the AND — i 

: ; an No Yes _ No 
operator. The outcome is only true if both of the conditions are true. —" = | 
(Usually you will see a ‘Yes’ written as a ‘1’ and a ‘No’ written as a ‘0’) be =? No | 

s | Yes | Mes 


Ye 
Perhaps the game doesn’t have lives and we only want toendthe game | 


if you run out of ‘health’ or you pass the winning score of 1 million. a 
| TF health <= 0 OR score > 1000000 THEN 


SET game_over TO true 0 0 0 
END IF 
: | 9 1 
Here are the possibilities — this time it is the truth table for the OR " . ; 
operator. ; 
In this case the outcome is true if either of the conditions is true. L 
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Boolean: something that can 
take only the values True or 


| False; named after English 


mathematician George Boole. 
Logic circuit: an electronic circuit 
that has inputs and outputs 

that follow one of the Boolean 
operators. 


There is one more logical operator (or 
Boolean operator) called NOT. This 
operator just swaps 1s and Os. Notice that 
unlike AND and OR, NOT has only one input. | 1 | 0 


0 bal 


We could use this in our ‘game over’ example as follows. 


IF (NOT god mode AND health <= 0) OR score > 1000000 THEN 


SET game over TO true 
END IF 


We can list all the possible combinations in a truth table as below. (Like in 
maths, brackets are evaluated first.) 


— 
<=) 


0 i | acca 
0 


| 


- 
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The three logic operations AND, OR and NOT are very easy to build as electronic 


circuits known as logic circuits. 


1 Write, in words, the condition for the game to be over in the final 
example in the table. (You will see why Boolean logic is clearer and 
easier to use than English when the situation gets complicated,) 


State which part of the CPU works out the logic for conditions. 
Write a computer program that asks for your year group, grade and 


not. Start by assuming revision classes are only for Year 11 students 


invitations to revision class. 


4 Write the truth table for the statement below. This time there are 
three parts to the condition. Remember: as in maths, brackets are 
worked out first. 


IF year = 11 AND (grade < target OR target > 7) THEN 


SET revision class TO true 
END IF 


target and then prints out whether you are invited to revision class or 


whose grade is below target, then you can invent your own criteria for 
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Writing logic statements 


When you write programs you will often have to work out the correct logic 
statements for a given situation so that the right code runs. Mistakes in the 
logic (logic errors) are a frequent cause of bugs so it is important to check both 
that the logic is right and that the code matches the logic you have worked out. 


mene has decided to expand his gardening business to give online advice. He is writing a web page that 
advises customers which fertiliser to buy from his online shop. The web page will ask customers about their 
plants and then give relevant advice using these rules: 


¢ If plants have yellow leaves mainly near the soil then they need a nitrogen fertiliser. 
¢ If plants have brown leaves that are small then they need a phosphorous fertiliser. 


To program this, you first need to work out what are the important features in the rules — these will be your 
variables. In this case, the colour of the leaves is important. We can write the rules in pseudo-code like this. 


SET advice TO ‘nothing needed’ 
IF colour = ‘yellow’ THEN 
SET advice TO ‘nitrogen’ 
ELSE 
IF colour = ‘brown’ THEN 
SET advice TO ‘phosphorous’ 
END IF 
END IF 


Notice that we started off by assuming the plants were growing normally and only changed the advice if certain 
conditions were met. 


Monty points out that if leaves go brown, but are normal sized, then the customer needs potassium fertiliser - 
now the size of the leaf is also important. 


SET advice TO ‘nothing needed’ 
IF colour = ‘yellow’ THEN 
SET advice TO ‘nitrogen’ 


ELSE 
IF colour = ‘brown’ AND size = ‘small’ THEN 
SET advice TO ‘phosphorous’ 
BLUSE 
IF colour = ‘brown’ AND size = ‘normal’ THEN 
SET advice TO ‘potassium’ 
END IF 
END) ie 
BND LE 
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‘Activity 17 : > 


1 Explain why this code would 
not work if the last two lines 
were swapped around. 


2 Howcan you change the 
conditions so that it works with 
the code either way round? 


Note that we have used brackets 
in this example. You have 
learned about BIDMAS in your 
maths lessons and in Chapter 1 
(page 25). It tells you the order 
to do the sums. It is the same 
with logic operators: brackets 

go first, then NOT, then AND 
and finally OR. This is called 
operator precedence. 


Operator precedence: the order 
in which you apply the operators 
(including logical operators) ina 
mathematical equation. 


Activity 16 
1 Add the following rules to Monty's website logic on page 157. 


° If plants have cracked or misshapen leaves then they need a 
calcium fertiliser. 


° If plants have leaves that are yellow mainly on the tips then they 
need a magnesium fertiliser. 


2 Why is Monty's advice an example of abstraction? 


Lisa has built her own programmable alarm clock. She wants to program 


the alarm to wake her up at the correct time depending on the day. She 


gets up 
¢ at 7.30 a.m. if it is a school day. 


¢ at 9.00 a.m. if it is the weekend or a school holiday, except on Saturdays 


in term time when she gets up at 8.00 a.m. to play hockey. 
Let’s work out the important features of these rules. The alarm times 


depend on whether it’s term time or holidays and also on the day of the 


week; so term time and day will be the variables for your conditions. 


SET alarm TO “7230 am’ 
IF term time = false OR (day = ‘Saturday’ OR day = 
‘Sunday’) THEN 
SET alarm TO ‘9:00 am’ 
ELSE 
IF term time = 
SET alarm TO 
END IF 
END IF 


true AND day = ‘Saturday’ THEN 
‘3 O0" am’ 


Activity 18 


1 Work out what happens for the following statement. 


IF gender = ‘female’ AND (subject = ‘computer science’ 
OR subject = ‘physics’) THEN 

SEND “Superstar!” TO DISPLAY 
END IF 


a For a girl studying French. 
b For a girl studying computer science. 
c Fora boy studying physics. 


2 Which answer would be different if the programmer had forgotten the 
brackets? 


8 
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1 Edward is developing a logic circuit for an automatic door system. The operator precedence for 
* The output from the sensor on the street (5) is 1 when someone NOT, AND and OR are likely to 
approaches the door from outside. | be tested in the exam question, 
* The output from the sensor in the building (B) is 1 when someone so you need to know it. 
approaches the door from inside. | This means you should always 
* The opening mechanism works only if the manual switch (M) is off (0). | Check carefully whether you 
Write a suitable IF statement in pseudo-code to represent this logic. | need brackets. 


(2 marks) 
2 Write pseudo-code representing the logic for a cinema ticket pricing 
application. (3 marks) 
e Standard pricing is £5. 
¢ Children under 4 are free. 
¢ Children under 16 and people over 65 are half price. 


e AND, OR and NOT are called logical or Boolean operators. 


¢ They are used in selection statements such as IF. 
e Atruth table shows all possible combinations of inputs and the output produced. 
¢ The order of precedence is: brackets, NOT, AND, OR. 


$1 Construct a truth table for the AND operator. 


$2 Write a logic statement for the operation of headlamps on a car - the headlamps come on if the light sensor 
reading is dark or if the headlamp switch is turned on. 


$3 Produce a revision summary of this chapter. 


C1 Construct a truth table for the expression P = (A AND B) ORC. 
C2 What combination of logic operations would produce an output of 1 only if the two inputs were different? 


How confident are you in your answers to these questions? If you are not 


sure that you answered them well, go back and review the examples on 
pages 157-158. 
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4.4 Software 


By the end of this section you should be able to: 

¢ describe what an operating system is and how it manages files, processes, hardware and the user interface. 

° describe the purpose and functions of utility software (managing, repairing and converting files; compression; 
defragmentation; backing up; anti-virus, anti-spyware). 

¢ describe how software can be used to simulate and model aspects of the real world. 


Kenems 


| Software: the set of programs run | 


by a computer system. 


| Application software: software 
that performs a task that would 
otherwise be done by hand, 
perhaps with pen and paper. 


| Operating system: software 
! designed for particular hardware 
and which manages other 


programs’ access to the hardware. 


Utility software: software that 
does a useful job for the user that 
is not essential to the operating 
system and not the reason for 


using a computer in the first place. 


| 


Computer systems consist of both hardware and software. Software 

is the set of programs that run on a computer system. You can usefully 
divide software into two types. The first type is what you normally think 

of as programs, those that do a job for us that would otherwise have to be 
done by hand, such as payroll calculations. This is application software. 
The other kind of software is there to help us use the computer and the 
application programs, such as providing the user interface and tools to 
manage the hardware. This is system software and it is usually divided into 
operating systems and utility software. 


Operating systems 
The operating system (OS) is usually loaded when the computer starts up. 


You will probably already know several operating systems, for example 
Microsoft® Windows®, Linux, Android, iOS etc. 


Why do you need this software? Unlike embedded systems, a general- 
purpose computer can be used to do a range of jobs. Software written 
for general-purpose computers needs to be able to run on a range of 
different hardware, for example your favourite web browser runs on your 
laptop or desktop computer and also on your mobile phone or tablet. 
One of the jobs of an operating system is to manage whatever input and 
output devices the computer has. A programmer writing an application 
uses simple commands, say OpenFile, and the operating system deals 
with exactly how to open a file. This will depend on what sort of secondary 
storage the file happens to be on. You might have realised this is yet 
another example of abstraction. 


This system means applications have to be written for a particular 
operating system but will work on any hardware. You will probably already 
be familiar with this idea because you know, for example, that Android 
apps run on any Android device. Many different models of smartphone or 
tablet will run the app, but the same apps won't run on your friend’s 

iOS phone. 


160 


Figure 4.13 The operating system sits between the hardware and the programs 


The operating system provides developers of application programs 

like word processors with a set of commands such as OpenFile or 
Display)PEG: this set of commands is usually called an application 
programming interface or API. Using the operating system API rather 
than accessing hardware directly means that an application program 
will run on the operating system no matter what hardware it is installed 
on. It also means that the operating system controls what application 
programs are allowed to do. This helps to reduce the risk of programs 
crashing and to maintain security. 


Managing files and the directory structure, and input/output in general, is 

one of the important jobs of the operating system. An operating system also 
shares access to the hardware among the different programs that are running 
(and on some operating systems among the different users who are logged on 
to the computer). Two important pieces of hardware that have to be shared 
are the CPU and RAM. 


e insection 4.2 you learned that a von Neumann system does only one thing 
at a time. When you have several applications open they can’t really be 
running simultaneously. The operating system keeps up this illusion by 
allowing each program to use the CPU for a short time before switching to 
the next program. This is called scheduling. 


¢ It is possible to load many applications all at the same time and each 
uses some memory. What if you keep loading applications until RAM is 
full? In this case the operating system creates another illusion, by moving 
programs from RAM to disk and back again when needed. This is called 
paging, swapping or virtual memory. You might notice this as a lot of disk 
activity and your computer running more slowly when lots of applications 
are open. 
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| Scheduling: the algorithm that 


, the OS uses to allow each running | 


_ process to use the CPU. 


' Paging: the algorithm the OS uses 
to move programs from RAM to 
disk and back again when needed 


once main memory is full. 


One of the functions of an 
operating system is scheduling. 


1 Describe the purpose of 
scheduling. (2 marks) 

2 Give three other functions of 
an operating system. 
(3 marks) 


The command word in Question 1 
is ‘describe’ and two marks are 
allocated to the question, so 
make sure you make two distinct 
points in your answer. 


Concurrent: processes that run 
apparently at the same time are 
described as being concurrent. 


Authentication: the process of 
proving to a computer system 
who you are (€.g. using a 
username and password). 


User interface: the way the 
user interacts with the operating 
system 


As well as the applications you are running, the operating system is also 
running other programs in the background. They include various parts of the 
operating system itself such as the programs that manage the network, the 
scheduler and also any hardware drivers and utility programs such as anti- 
virus. Each task that is running is called a process. To see all the processes that 
are running, including the invisible ones 


* on Microsoft® Windows®, press CTRL-ALT-DEL and choose Task Manager. 
* on Linux, type ps aux at the command prompt. 
* on OSX, run the Activity Monitor from Applications/Utilities. 


Information about how much the hardware is used by each process is 
displayed, including. 


¢ the proportion of the CPU time spent on this process. 
* the amount of memory used by this process. 
* the amount of available memory (real memory and virtual memory, VM). 


These processes are running one at a time, being given their share of the CPU 
time by the scheduler. They are all running alongside one another, but not 
really at the same time, so we say they are running concurrently. An operating 
system that allows processes to run concurrently is called a multi-tasking 
operating system. Almost all modern operating systems are multi-tasking. 


As well as allowing multi-tasking, some operating systems allow different 
users to use the hardware. In some cases only one user can use the 
hardware at once (single user), whereas in others several users can be 
sharing the hardware (multi-user). Your smartphone is an example of a 
single user system, whereas your school network servers will be running a 
multi-user operating system so that many of you can log on at the same 
time. In a multi-user system, each user has to identify themselves (with a 
username, fingerprint etc.) and provide a password (or other log in) to prove 
they are who they say they are - this is called authentication. Authority 
to use the facilities of the computer system is based on who you are, for 
example learner users might not be able to print, whereas tutor users 
can. In such systems there is usually also a super-user (sometimes called 
administrator or root) who can access all resources and also control the 
access rights of other users. 


The operating system also provides and runs the user interface. Sometimes 
this is a graphical user interface (GUI) with windows, icons, menus and 
pointers (WIMP) like Microsoft® Windows® or MacOS, or sometimes it is just a 
text, command-line interface (CLI) like MS DOS. 


The choice of user interface depends on the applications a computer is 
expected to run. An embedded system might not have an operating system 
as such, and therefore does not have a user interface. A computer acting 

as a web server might only need a command-line interface because it is 
intended only ever to be managed by technicians. A computer used for 
video-editing will almost certainly need a graphical user interface. 
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In common with many school subjects, as you learn more about computer 
science you will discover that some details were missed out so that you 
could understand the key ideas first. At least in computer science you are 
used to this idea of abstraction. One such detail is the idea of a driver. 

The driver for a device is the software that really communicates with the 
I/O device on the computer's bus. It transfers data between the hardware 
device and the operating system in an agreed format — also known as the 
application programming interface (API). To find out more about drivers 
and operating systems search online for ‘NDIS’ — this is the API that is used 
for Microsoft® Windows® network drivers — or ‘ALSA’, the API for sound 
drivers on Linux. 


An API is yet another example of abstraction. A software developer deals 
with the essential tasks like OpenFile, but the detail of how data is taken 
from secondary storage is hidden, managed by the operating system. 


| Activity 19 


*—J 


| 1 Run/open the program that lists the running processes on an 

operating system you are familiar with. Make a table listing the 

| application software, utility software and operating system processes 

that are running. 

2 There are several different algorithms that operating systems can use 
for scheduling. The one described on page 161 is called round-robin 
scheduling. Research and describe at least one other method used for 
scheduling. 

3. Take one of the programs you have already written and add a feature 


Utility software 
Utility software is software that does a useful job for the user. It is 
¢ inessential to the operating system, and 


e not the reason for using a computer in the first place (that is, it is not 
application software). 


Utility software, sometimes called tools, can be split into three areas: 

e basic tools; 

e file management; 

¢ security. 

Basic tools 

Basic tools include things like a simple text editor (e.g. Notepad or nano), 
calculator, command prompt, hex editor and software for accessibility 
such as for producing large print. They are usually included as part of the 
operating system. 
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Activity 20 ? 


_as ‘file management tools’. Pair 


Make a list of all the utility 
software you can find on your 
computer. Try to organise your 
list into groups or categories such 


with a partner and share your list. 
_ Make a ‘problems’ list of software 
that you cannot agree a category 
for. Finally pair with another 
couple of learners and compare 


' their problem list with yours. 


Back-up: a copy of files in another 
location so that they are still 
available if the original copy is 
damaged or lost. Backing up is the 
process of making a back-up copy. 


Defragmenter: a utility that 

moves file clusters onadiskso | 
they are closer to each other in | 
order to speed up disk access. | 


Virus: software that is designed 
to make and distribute copies 
of itself, usually for a malicious 
purpose. 


Spyware: software, possibly 

a virus, that is designed to be 
installed secretly on a computer 
and record private information as 
the user enters it. 

Firewall: a utility that controls 
program access to the network, 
both incoming and outgoing. 


Be careful not to confuse 
compression with encryption. 


File management tools 

No matter how much storage you have, somehow you always manage to 
fill up the space with files. This brings problems such as not being able to 
find your favourite photos, losing files if there is a fault and things simply 
running slower on your computer. 


File management tools include software to keep your data secure by making 
a back-up copy of your files in another location. Some programs 

can recover files that have been deleted or repair files that might have 
become corrupted. 


Another useful file management tool is for converting files between 
different formats. This is especially useful for multimedia files where there 
are a large number of common formats (e.g. jpg, gif, png). 


Another important file management tool is a defragmenter which is used 
to speed up access to data stored on magnetic hard disk drives. Files are not 
stored on disk in one large chunk. They are stored in smaller pieces called 
sectors or clusters and a single file’s clusters can be spread across the disk. 
Accessing these fragmented files can be slow, so a defragmenter moves 

the bits of files around so they are closer together, speeding up disk access 
(at least in theory). Note that, unlike magnetic disk drives, solid state/flash 
drives do not require defragmentation as access times are the same across 
the entire volume. 


You can also use a file management tool to compress the flles you don’t use 
very often. This frees up space on the disk but those compressed files are 
then slower to access as they have to be uncompressed to open them. 


Security tools 


Security tools include anti-virus and anti-spyware software. Computer 
viruses and spyware can be used to capture private information, to 
introduce a process on your computer to do a job for the attacker such 

as send out spam email, or to crash an important machine such as a web 
server. Some are simply written to annoy users. Such software is called a 
virus because it copies itself and spreads: anti-virus tools try to identify code 
that copies itself and prevent those processes from running. 


Firewall software controls your connection to a network, deciding which 
network data can flow to and from your computer. This can help to prevent 
infection by viruses. 


Encryption software is used to make files unreadable without a password. 
Data on your tutor’s laptop might be encrypted so that it is not 
understandable if the laptop is stolen. 
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Simulation and modelling 


In Chapter 1 you learned about how you can use abstraction to model the 
real world. By now you will be getting used to the idea of abstraction as a 
central part of computer science. It means hiding complexity, only exposing 
the key or important features of a situation and leaving out the details. 


One important application for computer science is the ability to do 
experiments that can’t or shouldn't be done in the real world. For example, 


* you can't increase the average temperature of the Earth to see what 
happens to our weather; 


* you shouldn't stop one of a pair of twins from exercising to see if it causes 
depression. 


You can, however, write a computer program to model or simulate these 
situations and see what results you obtain. Scientists, engineers, economists 
and other experts can ask ‘what if?’ questions using computer models, for 
example, what if the bridge were twice as long? 


There are two big problems with this approach. 


¢ The model or simulation includes assumptions. It is not reality so the 
answers might not be right. 


e The real world is far too complicated to allow for every possible factor in 
your model, so you have to use abstraction to simplify it. This makes it 
even less likely that you will get a totally right answer. 


Nevertheless, computer models can reveal some interesting things about 
how the world works or about how humans work with machines. 


Examples of computer models 

A flight simulator allows pilots to train to fly aircraft safely. They can make 
mistakes in the simulator without endangering their own lives or the lives 
of passengers. Commercial flight simulators use a moveable cabin and real 
flight controls so pilots can see, hear and feel things just as if they were 
flying a real aircraft. 


If you ever watch the weather forecast you will sometimes hear the presenter 
talking about their computer and sometimes even about computer models. 
The atmospheric models used to forecast the weather are some of the most 
complex pieces of software that exist. They split the atmosphere up into small 
pieces and keep track of the temperature, pressure, humidity and other factors 
in each piece. They start the model with real measurements, then they let it 
calculate the effect each piece of atmosphere has on its neighbours and see 
how the weather patterns develop. 


Another example of a computer model is used for earthquake prediction. 
Unlike the previous examples, we don’t know the mathematics behind how 
earthquakes work. Models like these have to use information from past 
examples and probability to come up with best guesses as to what might 
happen and when. Some common techniques used in such models are 
heuristics, Monte Carlo methods and neural networks. There is a whole 
field of computer science devoted to finding patterns in data and using 
them to predict the future: this is machine learning. 
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‘What if?’ question: running a 
computer model with a given set 
of inputs to see what the model 
produces as an output or prediction. 


Heuristic: a type of algorithm 
capable of finding a solution to 
a problem quickly and easily, by 
using a combination of trial and 
error and educated guesswork 


- to cut corners and eliminate 


less likely alternatives. Heuristic 
algorithms don't always find the 
best solution but they will usually 
find one that works. 


Monte Carlo methods: carrying 
out a Statistical analysis of a 
number of random samples in 
order to obtain approximate | 
solutions to a problem. The larger 
the number of samples used, the 
more accurate the result is likely 

to be. 


Neural networks: processing 
information in a similar way to 
human brains and learning and 
adapting over time. This makes 
them useful tools for solving 
pattern recognition problems, 
such as facial recognition, 
medical diagnosis and quality 
control, which computers are 
normally not very good at. 


Activity 21 


1 Using a programming taneuage that you are e learning, 
a write a simulation of rolling a die. The die must come up with 1 to 6 with equal probability. 
b adapt your program so that the die is biased to give 6 more often than it should. 


2 In agroup, research techniques used in machine learning and illustrate your findings using presentation 
software. Bring your talks together and present them back to the class. 


¢ An operating system (OS) is software that 
O manages a hardware device; 
O manages users’ access to the I/O devices, including files on secondary storage; 
Oo provides the user interface; 
o shares the computing resources such as CPU and memory between processes. 


¢ Processes are tasks running on the computer, including application programs. 
¢ Utility software does useful jobs for the user, such as file management. 
* Computer modelling enables experts to ask ‘what if?’ questions without affecting the real world. 


* Computer models are very useful, but contain assumptions and simplifications so you can't rely completely 
on their results. 


en. 


Sachi i ae peep Beano. is RE SI SS PSV oe NCI 


S1 Name one operating system =r list aye main areas it penrerames 


S2 Describe the differences between command-line and graphical user interfaces. 
$3 Explain why anti-virus software is utility software. 


S4 Computer models suggest that if we continue burning fossil fuels at the current rate, average sea levels will rise 
25cm by 2050. Disiis ihe aduantages alta Bi eaibiees of using a computer model for this problem. 


Challenge © —--- a —_ ar 


C1 State what is meant nonin a process eset eH the purpose aE re heat 


PLONE ag os 0) 


C2 Suggest why some users might prefer a command-line interface to a graphical one. 


C3 Write a program to model the populations of rabbits and foxes in a habitat. Use the following 
information to help you. 
Foxes eat rabbits 
Foxes die of starvation if they don’t eat enough rabbits 
Rabbits and foxes both reproduce, but rabbits reproduce faster 
Rabbits and foxes both die of other causes 


How confident are you in your answers to these questions? If you are not 
Sure that you answered them well, make a revision resource by researching 


operating systems, utility software and computer models online before 
attempting the questions again. 
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4.5 Programming languages 


By the end of this section you should be able to: 


* explain what is meant by high-level and low-level programming languages, and describe their suitability for a 
particular task. 


* explain what is meant by a compiler and an interpreter and give the advantages and disadvantages of each. 


Low-level programming language 

You learned in section 4.2 that a von Neumann architecture computer uses 
the stored program concept. The instructions that the computer carries 
out, the program, are stored in memory just like any other kind of data. This 
means that programs are also just binary numbers — that’s all you can store 
in memory, after all. So how can binary numbers represent a program? 


One other thing you learned in section 4.2 is that the CPU can only doa 
few very simple things. It only ‘understands’ a few very simple instructions. 
Complex things like playing games emerge from having a really large 
number of those simple things happening very fast. Those few simple 
instructions that a CPU knows how to do are called the instruction set 

for that type of CPU. Each instruction is given a binary code and it is these 
codes that make up your programs. The binary codes representing a 
program are called machine code. 


' Instruction set: the list of all 
possible commands a particular 
. CPU knows how to carry out. 


A machine code program would look something like the diagram below 
— you can probably see why programming in machine code would be 
completely impractical. 

Machine code: the binary 


0110100100100101010101011111010101100101011010110101010! = codes representing each ofthe | 
instructions in the instruction set. 


Figure 4.14 An example of a machine code program Translator: a program that 


' converts source code into ! 
Not only would it be incredibly difficult to work out the sequence of 1s and machine code. 
Os you need, but also your program would only work on that kind of CPU 
because different CPUs have different instruction sets. Instead, you need 
a translator which converts your programs into the CPU’s machine code 


Source code: the text of the 
program that a programmer 


writes. 
language. 
Assembly language: a low- 
Translators are also programs. Their input is the text of your program - this level language written using 
is called your source code. Their output is machine code which the CPU can rangnnion Ge 


UMTS Sule Low-level programming 


The simplest translator is called an assembler. It converts assembly language language: a programming 
to machine code. Assembly language is called a low-level programming language that is closely related to 
language. This is because the programmer is working at the lowest level of | the CPU’s machine code. 
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Mnemonic: a short, simple, 

acronym that represents each 

of the instructions in a CPU’s 

instruction set, e.g. LDR (load 

register), STR (store) and CMP 
' (compare). 


Make a list of tasks for which 
assembly language is better 
suited than a high-level 
programming language. 


detail, the level of the CPU hardware itself. Each instruction in assembly 
language is the same as a machine code instruction. 


Assembly language is a bit easier to work with than machine code 
because each instruction is written as a short, memorable keyword called 
a mnemonic. For example, the instruction for the CPU to add numbers 
together might be 01101001 in machine code, but is just ADD in assembly 
language. The assembler replaces each mnemonic with the appropriate 
binary machine code and data. 


LDR R3, [R1] 
MOV R2, #0 
CMP R3, RO 
MOVGT R2, #1 


An example of assembly code 


The 1980s space-trading game Elite, which featured 3D graphics and a 
galaxy of hundreds of planets to explore, was written entirely in assembly 
language and occupied just 22 KB. To see what an amazing feat of 
programming that was, look at the file size of a modern 3D graphics game. 
To learn more you could play an authentic version of the game on an 
emulator — search online for ‘BeebEm’. 


Writing programs in assembly language is challenging for three 
reasons. 


¢ Avery limited range of instructions is available. Every task, even the 
simplest, has to be built up from the smallest steps — some older 
CPUs could not even multiply numbers. 


* You have to manage all your data. There are no strings, integers or 
real numbers, just binary, so you have to decide how to represent 
your data. You also have to decide and manage where it is stored in 
memory. 


* Debugging is very difficult. When the assembled program runs, any 
bugs usually just make the machine crash and you have to reboot to 
try again. 


Nowadays almost no programs are written in assembly language. The 
exceptions are some hardware drivers and programs designed to run 
on embedded systems that you learned about in section 4.2. This is 
because writing in the CPU's home language can be useful if you want 


your program to be very small and very fast or to directly control the 
I/O hardware. 
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Extend your knowledge 


Research the Little Man Computer simulation, Make a list of the machine 
code instructions that Little Man Computer ‘understands’, 


High-level languages 
It is much more common to write software in high-level programming 


languages such as Python, C, Java or Visual Basic. For these programs to run 
on the CPU the source code has to be translated into machine code. This can 


_ Compiler: a translator that 
_ converts high-level language 
' source code into object code, 


be done 


* all at once, and the finished machine code program saved and run later; 


¢ one line at a time. 


Compilers and interpreters 


A translator that translates the whole program in one go is called a 
compiler and one that translates and runs your program one line at a time 


is called an interpreter. 


A compiler converts high-level language to machine code and saves the 
Output as a machine code program, sometimes called object code. This 
usually has the .exe file extension, short for ‘executable’, because the 


machine code can be executed or run. 


An interpreter translates your source code one line at a time and runs it 
there and then. There is no object code file, only your original source code. 


Every computer that will run your 
program needs the interpreter 
software installed. 


Interpreters find errors when they 
happen and can often tell you 
what has gone wrong. 


Programs tend to run slower 
using an interpreter because the 
interpreter has to translate the 
source code while the program 
is running. 


The output from a compiler will 
run on its own on any similar 
computer. 


A compiler cannot produce 

any object code unless the whole 
program is correct — they tend 

to report a lot of errors initially, 
making it harder to debug your 
program. 


It can be easier to protect your 

code from being altered or copied 

if it has been compiled because you | 
only give people the object code | 
(machine code), which is hard 

to understand. 
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often machine code. The source 
code is translated all at once and 
saved to be executed later. 


Interpreter: a translator that 
converts high-level language 
source code into object code, 
often machine code. The source 
code is translated and executed 
one line at a time. 


Object code: the translated 
source code. Often this will be 
machine code, but might also be 
an intermediate code, which has 
to be further translated before it | 
can be executed. 


Activity 23 ? 


Implement one of the algorithms 
you designed in Chapter 1 

in a graphical programming 
language, such as Scratch, and a 
textual language, such as Python. 


Which language did you find 
most intuitive, which was easiest 
to use? Which one would you 
recommend for someone starting 
to program? 


Sasa 


Manjit is using a high-level programming language to develop a computer The command word in Question 2 
game for children. | is ‘compare’, so you need to talk 
about similarities and differences 
between a compiler and an 
interpreter. Make sure your 
answer relates to both. 


1 Explain why the program code she writes will need to be translated 
before it can be executed by a computer. (2 marks) 


2 Compare using a compiler with using an interpreter to translate the 
code. (4 marks) 


e CPUs can only run programs written in machine code. 
¢ Atranslator is a program that converts source code into machine code. 


* Humans program computers mostly using a range of high-level languages, but sometimes in assembly language (a 
low-level language). 


¢ Compilers and interpreters are types of translator, each having their own advantages and disadvantages. 


Stengel eee a 

$1 Explain what is meant by machine code. 

$2 What is the job of a translator? 

S3 Momina is a C programmer and uses a compiler. Denis is a Python programmer and uses an interpreter. What 
challenges will Momina face in developing software compared with Denis? 


Challenge | 
C1 Explain the difference between high-level and low-level programming languages. 
C2 Evaluate the choice of translator between compilers and interpreters. 


How confident are you in your answers to these questions? If you are not 
sure that you answered them well, you could review the chapter by 
searching online. 


170 


Chapter 5 


Communication and the internet 
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5.1 Networks 


Learning outcomes hh 


By the end of this section you should be able to: 


explain why computers are connected in a network. 

describe the different types of networks and usage models. 

describe the characteristics of network topologies. 

describe wired and wireless connectivity. 

explain that network data speeds are measured in bits per second. 

describe the role and need for network protocols. 

explain that data can be transmitted in packets using layered protocol stacks. 


Computer networks and their use 


sKeyiterm = Computer networks 


Network: an arrangement of 
computers and other devices 
connected together to share 
resources and data. 


Internet-connected fridges 


A network is a collection of two or more computers that are connected 
together for the purpose of sharing resources and data. All types of 
computer (e.g. desktops, laptops, tablets, e-readers, gaming systems, shop 
tills and even internet-enabled fridges) can be linked in networks. 


Many networks include servers. A server is a powerful computer that 
provides the network with a service, such as storing files or sending/ 
receiving emails. A small network might have one server, whereas a large 
business network could have tens or hundreds of servers. 


Why are networks used? 


A network can support multiple users accessing multiple services 
simultaneously. 


are already available. They 


can provide details of what Almost all organisations (small or large), including schools, have a computer 

fam chide theniancicenc network that provides multiple services to its users. The network allows 

reminders to your smartphone multiple users: 

or tablet when you need to ¢ to read/write personal files on a central server — this provides extra 

restock items. storage space to that on a single computer and also gives a back-up 
facility; 


* to access shared files among several users; 

* to download data or updates to computer programs; 
* to send data to a shared printer; 

¢ to access the internet; 


* to communicate with each other — perhaps through email or video, or to 
play interactive games. 


Wire 


| Carry out the necessary research, then create a leaflet that explains to Local area network (LAN): a 


' new learners the following. 


network that covers a relatively 


1 What a network is and what kinds of devices can be connected to it. small geographical area, often a 


single site. 


| 2 What services your school network provides and how to access them. . 


boy : What services a home network typically provides. 


Different types of network 
LAN and WAN 


. Wireless local area network 
(WLAN): a local area network 
in which connected devices use 
high frequency radio waves to 
communicate. 


Wide area network (WAN): 


A local area network (LAN) is a network that covers a relatively small a network that covers a large 
geographical area. This is often a single site, such as a home, a hospital or | geographical area. It connects 
a factory. ' together two or more LANs 

and is usually under collective 
The hardware (e.g. cables, routers, etc.) that connects the computers, servers ownership. The largest wide area 
and other hardware devices are usually owned by the organisation that the network is the internet. 


network belongs to. 


Many people have a wireless local 
area network (WLAN) at home. 

A wireless router allows all the 
computer devices in a household 
to access the internet and share 
devices such as printers and 
external hard drives. 


A wide area network (WAN) 
covers a large geographical area, 
usually across several sites of an 
organisation. Each site has one 
or more LANs, and they are all 


connected together to make a WAN. 


The WAN allows employees on 
different sites to communicate and 
share data. 


—— 
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Figure 5.1 LANs may be connected together to form a WAN 


ve 


A bank wants to share data 
between its head office and its 
400 branches. Explain why the 
bank would use a WAN rather 
than a LAN for this purpose. 
(2 marks) 


This question tests your 
understanding of the differences 
between a LAN and a WAN. A 
good answer should explain the 
difference between a LAN and a 
WAN and then explain why the 
network used by the bank to 
share data between head office 
and the branches would need to 
be a WAN. 


Client-server network: a 
network that has at least one 
server to provide services to the 
client computers. 


Peer-to-peer network: a 
network that doesn’t have 

any centralised servers. Each 
computer in the network can act 
as client and server. 


Client-server and peer-to-peer networks 
There are two main models relating to computer networks — client-server 
and peer-to-peer. 


Client-server 

In a client-server network there is at least one server, which is a powerful 
computer that provides a service or services to the network. Individual computers 
(such as the ones in a typical school computer room) are referred to as the client 
computers. The server will authenticate the user (see section 5.2) and then give 
the user access to the files that he/she has been given permission to access. 


The server provides services to the clients as required. 
A client accesses data or files from a server using the following process: 


1 Actient will make a connection to the server using its address. The server 
will know the address of the client because this will be included in the 
request for a connection. 


2 Once the connection has been made, the client will make a service 
request to the server. 


3 If the request is valid, the server will send the requested data to the 
client using the address identified in step 1. 


Sometimes additional steps, such as authentication, are needed, 
depending on what the client is trying to access. For example, when Chris 
wants to check his web-based email from his laptop he will use a web 
browser (the client) to access the website, which is hosted on a server. 
The website will serve a web page that asks for his username (his email 
address) and his password. 


He will enter his username/password and then submit them to the server. 
If the details are correct he will be authenticated and the server serves 
back the web page that displays his emails and allows him to manage his 
email. 


Peer-to-peer 


Unlike in a client-server network, there aren't any centralised servers ina 
peer-to-peer network. Each computer can act as both a client and a server. 
Each computer.in a peer-to-peer network can provide a service, such as 
share some files or provide access to a printer. Each computer can also 
request services from any other computer that has been configured to 
provide that service. 


Some instant messaging systems use a peer-to-peer service to allow users 
to chat in real time. There is no central server that you sign on to and your 
messages are sent directly to the recipient. 
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Some peer-to-peer networks connect using the internet. These 
particular peer-to-peer networks are often used for sharing files, 
especially pirated movies, music and computer games. It is difficult 
to find out who is sharing files when they are distributed using peer- 


to-peer networks because no records of who downloaded what are 
normally kept. 


There are also legal uses of internet-based peer-to-peer file sharing, 
for example some versions of Linux are only downloadable using peer- 


to-peer file sharing. The lack of records also benefits those people 
worried about privacy. 


Network topologies 


There are several different ways that the connections between networked 
devices can be arranged. The arrangement of these connections is referred 
to as the network topology. 


There are four main network topologies — bus, ring, star and mesh. 


Bus 


Computer Compuier 


Terminator | 
Computer 


Printer 


Figure 5.2 Bus network topology 


A bus network consists of a single cable to which each networked device 
is connected. Messages are sent along the cable in the form of electronic 
signals. At each end of the cable is a terminator. 
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Activity 2 


Create a table comparing a 
client-server network with 

a peer-to-peer network. Use 
one column for the client- 
server network and another 
for the peer-to-peer network. 


Research why internet-based 
peer-to-peer networks are a 
popular way of distributing 
pirated material such as 
movies or music. Present your 
findings as a short report. 


Network topology: describes 
how the devices on a network are 
_ connected together. 


Server 


Terminator 


i) 


Computer 


The purpose of a terminator is to absorb signals that have reached the end 
of the cable, preventing them from bouncing back and causing interference. 


As a bus network only has one cable (the bus), only one message can be sent 
at any one time. A bus network needs a system to deal with multiple devices 
sending a message at the same time. The most common method is called 
Carrier Sense Multiple Access with Collision Detection (which is shortened 

to CSMA/CD). A collision occurs when two or more network devices send a 
message at the same time, making all the messages unreadable. 


A simplified CSMA/CD sending algorithm works like this. 


Check if bus is busy. 
If not busy then send message, else go to step 1. 
Listen to see if message received correctly. 


fr WN 


If message not received correctly then go to step 5, else go back to 
listening for messages. 


5 Wait random amount of time, go to step 1 to retry sending message. 


Making each device wait a random amount of time (a small number of 
milliseconds) before checking whether it is safe to send prevents the 
competing senders from repeatedly sending at the same time. 


The bus topology isn’t widely used in business computer networks any 
more; however, it is used in many modern cars and some aircraft to 

allow all the different systems within the vehicle to communicate. The 
most well-known standard is called CAN (Controller Area Network) bus. 


Algorithms can be represented in many different ways. Improve your skills 
and understanding by drawing a flowchart (see section 1.1 of Chapter 1, 
page 3) that shows how the CSMA/CD algorithm works. 


Advantages and disadvantages of the bus topology 


Relatively cheap to install since only 
one cable is needed 


Easy to add extra network devices 


Whole network will fail if the cable is cut or damaged 


Can be difficult to identify where a fault is on the cable 


The more devices that are added to a bus network the slower they 


run. This is due to only one message being able to be sent at once and 
because more collisons happen 


All data sent is received by all devices on the network; this is a security risk 
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Ring 


Computer lf 
p 2 Server 


Computer 


Printer 


Computer y 4 Computer 


Figure 5.3 Ring network topology 


A ring network is a network in which the cable connects one network device to 
another in a closed loop, or ring. Each network device has what can be thought 
of as an ‘in’ and an ‘out’ connection. 


Messages sent on a ring network all travel in the same direction and, unlike 
a bus network, there are no collisions. Data is passed from one device to the 
next around the ring until it reaches its destination. 


Advantages and disadvantages of the ring topology 


Adding extra devices does not affect the performance Whole network will fail if the cable is cut or damaged or a 
of the network device on the network fails 


Easy to add extra network devices Because all the devices in the network are connected 
in a closed loop, adding or removing a device involves 
shutting down the network temporarily 


Can be difficult to identify where a fault is on the network 


More expensive to install than a bus network as it 
requires more cable to complete the ring 
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Figure 5.4 Star network topology 


In a star topology each network device is connected to a central point. This 
central point will be either a hub or a switch. A central hub receives and 
directs messages to the correct recipients. The star topology is the most 
widely used, but it does require a lot of cabling because each device is 
connected to the central point. 


Advantages and disadvantages of the star topology 


A damaged cable will not stop the whole network If the hub or switch fails then the whole network will fail 
from working, just the network device connected 
to it 


If a switch is used, then the network is efficient as Expensive to install due to amount of cable needed and the 
messages are only sent to devices needing them. ' hub or switch 

This also adds to security as not all devices will see 

a message 


Easy to locate faults because they will normally only 
involve one device 


A new device can be added or removed without 
having to close the network down 
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Mesh 
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Partially connected 


Computer 
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Figure 5.5 Mesh network topology showing fully connected and partially connected types 


There are two main types of mesh topology - fully connected and partially 
connected. In a fully connected mesh network every network device is 
connected to every other network device. In a partially connected mesh 
network some network devices may be connected to multiple other devices, 
but others might only be connected to one other device. Each device in a 
mesh network will pass messages on to other devices within the network. 


Mesh networks can be wired or wireless. In a wired network the number 
of connections needed for a fully connected network saon becomes very 
expensive and difficult to implement as the number of network devices 
increases. Wireless mesh networks don't have the same problems. 


Mesh networks are very fault tolerant. If a device or connection fails, 
messages are simply routed around it. In a fully connected mesh network a 
large number of failures could happen and the rest of the network devices 
would still be able to communicate. The ability of a partially connected mesh 
network to carry on working correctly will depend on how many connections 
to each device are available. 


The largest mesh network of all is the internet. 


Mesh networks are used to communicate between sensors in the ‘internet of 
things’, especially if the devices are mobile, because the optimum network 
paths can be used no matter where a sensor is located. 
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Internet: a worldwide system 
of interconnected networks 
| that enables information to be 
| exchanged and shared. 


Advantages and disadvantages of the mesh topology 


Very fault tolerant, especially in the case of a fully Difficult and expensive to install wired mesh networks 
connected mesh network - if one device fails, messages 


can be rerouted 


Very high performance because each network device is Can be difficult to manage due to number of 
likely to be connected to multiple other devices connections within the network 


In a wireless mesh network each node extends the range 


of the network 


Communication media: 

| the means by which data is 

| transmitted between devices on 
a network. Coaxial cable, fibre- 
optic cable and microwaves are all 
forms of communication media. 


A small business has a network consisting of three computers, one printer 
and a server. The network uses a star topology. Draw a diagram of the 
network and label all parts. (4 marks) 


¢ This question tests your knowledge of network topologies and the 
components within them. 


¢ A good answer will show the network cables in the correct layout, 
any required network hardware and the correct items connected 
to the network. The diagram will need to be labelled to show what 
each item is. 


Activity : ie 


1 Close this book and draw an annotated diagram for each of the 
following topologies: ring, star, bus and mesh. 


2 From memory, list the advantages and disadvantages for each 
network topology you drew in Question 1. 

3 Research which of the four network topologies in Question 1 would 
be most suitable for providing local access to the internet in a 
country that doesn’t have reliable wired communication links. Write a 
paragraph explaining your decision and how the system could work. 


Communication media 


There are many different ways that computers can be connected together to 
form a network. Networks are classified as either wired or wireless according 
to the transmission or communication media used. 


Wired 


Wired connection methods involve a physical connection between the 
computer and the network. Most wired connections are made of copper 
wire but they could also be fibre-optic cable, which is made of either glass 
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or plastic. Copper wire carries electrical signals, whereas fibre-optic cable 
carries light signals. Fibre-optic cables transmit data far faster and allow 
signals to travel over greater distances than copper wire. 


Wireless 


Wireless connectivity does not require a physical connection between 
devices. Most wireless connections transmit and receive radio signals, but 
other connection methods such as infra-red light or microwaves can be used 


over limited distances. 


Important methods of wireless communication are the mobile phone 


network, Bluetooth® and Wi-Fi®. 


Extend your knowledge ‘ 


Many TV remote controls use 
infra-red light to send signals 
to control the volume or 
change the channel. 


The internet of things is a term that covers the growing range of 


physical items that can be connected to the internet. These physical 
items (the ‘things’) can report details about themselves and be 
controlled over the internet. So your central heating could report the 
temperature of your house and allow you to adjust the thermostat 
from any internet-connected device, including your mobile phone. 


ZigBee is an open-source protocol for allowing devices to be 
controlled by using low-power digital radio transmitters/receivers that 


are embedded within the device. 


Wired or wireless connectivity? 


There are advantages and disadvantages to both wired and wireless 


connection methods. 


Protocol: a set of rules that 
govern how communications ona 
network should be formatted and 
what data they should include. 


Eavesdrop: having unauthorised 
sight of data being sent from 

one computer to another over 

a network. This is covered more 
fully in the later section on 

~ technical weaknesses (section 5.2, 
page 189). 


Wired connectivity 


Faster than wireless connectivity 


Not easy to intercept or eavesdrop on data 


Less susceptible to interference than wireless 
connectivity 


Wireless connectivity 


No need for a cable to connect devices or to the internet 
Allows users to use their own device 


A wider range of devices can communicate with each 
other/a network because it is not dependent on having 
the correct cable 
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Expensive to install and reconfigure 


Requires many cables at a premises 


Data transmission speeds can be slower than wired 
connectivity 


interference from other wireless devices can adversely 
affect performance 


Walls and other physical objects can adversely affect 
performance 


Data needs to be encrypted (see section 3.4) to 
prevent eavesdropping or interception 


, Bandwidth: the amount of data 
that can be carried on a network 
in a given period of time. 


Network data speed 
Being able to measure the speed of a network allows you to: 
¢ determine whether an equipment upgrade is necessary; 


¢ estimate how long it will take to download a file; 


© determine whether actual performance lives up to the promises made by 
the service provider. 


The speed that data can be transmitted through a communications medium 
is measured in bits per second (this can shortened to bps, b/s or bit/s). 
Modern high-speed networks can transmit billions of bits per second. 


bits per second bps 1 

kilobits per second kbps 1,000 
megabits per second Mbps 1,000,000 
gigabits per second Gbps 1,000,000,000 


Although the units listed above have similar names to the units used to 
measure storage (such as the capacity of a hard drive or the amount of 
RAM a computer has), they are different and you should make sure not to 
confuse them. 


For example, you might measure storage in kilobytes (see Chapter 3, 
section 3.3, page 117); however, you measure the speed of a network 
connection in kilobits per second. 


Calculating the time needed to transmit a file 


If you have a network connection of 12 Mbps and you want to 
download a 50 MB file, how long would it take? 


To find out: 

1 Convert the file size from megabytes to bits. 

50 MB x 1024 x 1024 x 8 = 419,430,400 bits 

2 Convert the transmission speed from Mbps to bits per second. 
12 x 1000 x 1000 = 12,000,000 bps 

3 Divide bits by bits per second 

419,430,400/12,000,000 

Time = 34.95 seconds 


These times are only theoretical as transmission speeds are not constant 
and other factors affect the speed of data transmission. 


The amount of data that can be carried from one point to another ona 
network in a given period of time is referred to as its bandwidth. Bandwidth 
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is usually expressed as bits per second (bps). Another important factor is the 
latency of the network connection. Latency refers to any kind of delay that 
data travelling through a network might encounter. A low latency network 


connection is one where any delays are small and a high latency network is 
one where data suffers from long delays. 


1 Find out how fast your school, home and mobile phone internet 
connections are. Compare these, ensuring you use the correct units 
(kbps, Mbps or Gbps). Make a note of your findings. 

2 Work out how long it would take to download a 300 MB file using 
your home and school networks. 


. Extend your knowledge 


A personal area network (PAN) is a network that is used to connect 
devices belonging to an individual person. These devices might include 
a mobile phone, a laptop, a tablet or a smartwatch. The PAN can be 
used to share data such as calendar entries or files between devices 
and is likely to include access to the internet through a mobile phone. 
Research uses of PANs and think about what data you would find 
useful to share between your different personal computing devices. 


Protocols 


In relation to computer systems and networks a protocol is a set of rules 
that determine how communications between devices are formatted and 
how these communications will be sent/received. 


Without protocols different computers and other hardware wouldn't be able 
to communicate with each other because they would essentially be speaking 
different languages. 

A protocol might contain details of: 

e how each computer will be identified (its address); 


¢ what route the data will take to get to its destination (routing 
information); 


¢ how errors will be detected and dealt with (error checking); 


* whether each part of a message should be acknowledged as received 
correctly; 


° what to do if data isn’t received correctly; 
¢ how the data is to be formatted; 


* how the data is to be sequenced (i.e. does it need to be sent in order or 
can it be put in its correct order later?); 


¢ how the speed of the sender and receiver can be synchronised. 
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In the 1980s, manufacturers of home computers each developed their 
own set of protocols, rather than all using a standard set of protocols. 
This meant that if you bought one manufacturer's computer, you 
would have to buy a printer especially designed for that system. If you 
later bought another manufacturer's computer you would also have to 
change your printer and most other hardware. 


There are many different protocols for different purposes. The main ones 
in use relating to networks and the internet are detailed below. 


Email protocols 


Emails are sent and received using a set of standard protocols. This means 
that when you send an email it doesn’t matter what email provider the 
recipient uses or what type of computer system they have. 


There are three main email protocols in use. 


POP3 


IMAP 


Simple Mail Transfer Protocol. This protocol is used when sending email 
through the internet. It details the format that messages are sent in, 
what commands email servers should understand and how they should 
respond to them. 


Post Office Protocol, Version 3. POP3 is the current version of the Post Office 
Protocol that is used for retrieving email from an email server. Normally 
email clients using POP3 will connect to the mail server, download any 
messages and then delete the messages from the server. 

Internet Message Access Protocol. IMAP allows emails to be accessed 
using multiple email clients. For example, you might access your email 
using an email client on your computer, your tablet and your mobile 
phone. IMAP leaves messages on the server until you delete them. This 
means that no matter which email client you use, you should see an up- 
to-date list of your email messages. 


Network protocols 


Ethernet 


Ethernet is a family of protocols that are used in wired LANs. They cover 
everything from the physical parts of a network, such as type of cable or 
optical fibre and type of connector to be used, to the logical parts, such as 
how data is sent, checked for errors and the speed that data can 

be transmitted. 


Wi-Fi 


Wi-Fi is a digital communications protocol that sets out how data is 


transmitted on wireless LANs. Wi-Fi is a trademarked term that is owned by 
Wi-Fi Alliance. 
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TEP 


The Transmission Control Protocol provides a reliable connection between 
computers. In this context ‘reliable’ means that the receiving computer can be 
certain that it has received all the data it should have (none of it is missing) 
and the data received is identical to the data sent (the data is correct) 
TCP does this by: 


* specifying that the receiving computer sends acknowledgements that 
each section of the data sent has been received; 


* using checksums to ensure that the data received is accurate; 


* allowing the receiving computer to tell the sending computer to slow 
down transmission, so the receiving computer has time to process the 
received data (this is called flow control); 


* ensuring that data sent up to the application layer (see below) contains 
no duplicates and is in the correct order, 


TCP is used when you access web pages, send/receive email or upload/ 
download files. 


TCP/IP 


TCP/IP stands for Transmission Control Protocol/Internet Protocol. TCP/IP 
iS a protocol stack, which means that it is a collection of protocols that 
work together. It is named after the two most important protocols used 
in the stack. 


These protocols work in a hierarchical set of layers, where each layer deals 
with a particular function of the network. Each layer will pass information up 
and down the protocol stack as it is processed. 


The TCP/IP protocol stack has four layers. Data passes down the stack when 
sending, and back up when receiving. 


Application This is the top layer of the stack. It is the layer which 
interacts with the user to provide access to services and 
data that is sent/received over a network. Examples of 
protocols that work at this layer are HTTP, FTP and email 


protocols. 

Transport This layer manages end-to-end communication over a 
network. There are two main protocols that operate at this 
layer - TCP and UDP. (You only need to know about TCP for 
GCSE Computer Science.) 

Internet This layer deals with sending data across multiple 
networks (possibly the internet), from the source network 
to the destination network. This is known as routing and 
is the role of the Internet Protocol (IP). 


Link This layer controls the transmission and reception of 
packets of data to/from a local network. 
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Checksum: an error detection 
technique. A mathematical 

_ formula is applied to the data and 
the resulting numerical value is 
transmitted with the data. The 
recipient computer applies the 
same formula to the received 
data and then compares the 
checksum sent with the data to 
the calculated checksum. If the 
checksums don’t match the data 
is likely to have been corrupted 
and the recipient computer 
requests the data again. 


You will sometimes see TCP/ 

IP described as protocol suite 
rather than a protocol stack. 
These two terms are often used 
interchangeably, but some 
sources State that the suite is 
the definition of the protocols 
and that stack is the software 
implementation of the protocols. 


Remembering a list of items 
where the order is important 
(such as the TCP/IP layers) can 
be difficult. Why not try using a 
mnemonic like ‘Another Truck 
Is Late’, where the initial letter 
of each word matches the initial 
letter of the layer name? 


| Packet: a small quantity of data 


- being sent through a network. The 


packet is labelled with the sender's 
address (source), the recipient's 
address (destination), how many 
packets are being transmitted and 
the position of this packet in the 
complete message. 


Web server: powerful computer 


| systems that store web pages and 


any multimedia that the pages 


might contain. 


Extend your knowledge f&\ 


Another protocol that plays 

a major part in the TCP/IP 
protocol suite is called UDP. 
Research this and create a 
table explaining the differences 
between TCP and UDP. 


Data sent using TCP/IP is broken up into packets. Each packet of data consists 
of a small section of the data being sent along with a packet header. 


A packet header contains details of: 

¢ the sending computer; 

¢ the recipient computer; 

¢ how many packets the data has been split into; 
¢ the number of this particular packet. 


Once the recipient computer receives all the packets, it will use the 
information in the header to reconstruct the data into its original format. 


A network that moves data in packets is called a packet switched network. 
(More details of how this works can be found in section 5.3.) 


Computer A sends data Computer B receives data 


Figure 5.6 Layers within the TCP/IP suite pass data down the stack when sending and back 
up the stack when receiving 


HTTP 


The HyperText Transfer Protocol is used when sending and receiving data 
between web browsers and web servers. The HTTP protocol covers how 
data should be formatted, what commands the web server and web browser 
should understand and how they should react to each command. 


For example, if you request a web page that doesn’t exist on the web server 
then the web server will send your web browser a 404 error message and your 
web browser will display this error and a short description of the error, 
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AiFPS 


The HyperText Transfer Protocol Secure is the secure version of HTTP. This 
means that the data sent between your web browser and the web server is 
encrypted, which should prevent the data being sent/received from being 
read by a third party. HTTPS also helps the web browser and user to know that 
they are communicating with the intended web server rather than a fake that 
is trying to steal sensitive data, such as passwords or bank details. 


FTP 


File Transfer Protocol is used to transfer files over a network that uses the TCP 
protocol (see above), such as the internet. 


FTP is often used when sending web pages and other associated files that 
have been created on a web developer's computer to the web server. Once the 
files are on the web server, other computers connected to the internet will be 
able to view them. 


Internet protocols 


IP 
At the internet layer, the Internet Protocol (IP) deals with: 


¢ the addressing system to identify individual computers/servers on the 
network (usually the internet); 


¢ splitting data into packets and adding the packet header with details 
such as the sender/receiver addresses. 


Each device linked directly to the internet has a unique IP address assigned 
to it to allow data to be routed to it. There are two versions of the Internet 
Protocol (IP) in use: the earlier version is called IPv4 (Internet Protocol 
version 4) and the newer version is called IPv6. IPv6 has a number of 
improvements compared with IPv4. One of the major ones is that it has 

a larger address space than |Pv4. Address space refers to the number of 
addresses available to assign to devices. 


Extend your knowledge h 


We are rapidly running out of IPv4 addresses, which is why !Pvé6 is 

being introduced. 

IPv4 has 4,294,967,296 addresses, whereas IPv6 has 

340,282,366,920,938,463,463,374,607,431,768,211,456 addresses. 

The format of IP addresses differs between the versions. 

° An |Pv4 address consists of four sets of decimal numbers separated 
by a dot. Each decimal number can range from 0 to 255. A typical IPv4 
address looks like this - 192.168.182.42 

* An IPv6 address consists of eight blocks, separated by a colon. Each 


block is a 16-bit hexadecimal number (leading zeros can be removed). 
A typical IPv6 address looks like this — fd2a:1450:4013:c01:0:0:0:5e 


Source: httes://www.arin.net/knowledge/ipv4_ipv6.pdf 
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When you use online banking 
your data will be transferred 
using HTTPS. Research why 
HTTPS is used rather than HTTP. 
What advantages does HTTPS 
give the bank customer and 
the bank itself? Create a poster 
summarising this information. 


Create a revision sheet that includes the following: 


1 adefinition of what a protocol is; 
2 a table that includes the following protocols: SMTP, POP3, IMAP, 
Ethernet, Wi-Fi, HTTP, HTTPS and FTP along with a brief description of 
what each one is used for; 
3 a diagram showing the TCP/IP protocol layers — you should annotate 
this to show what each layer does and which protocols run at each 
| layer. 


* A network is a number of computers and other devices connected together to allow them to communicate 
and share data. 


¢ ALAN covers a relatively small area. A WAN connects LANs together and spans a large geographical area. 


* Client-server and peer-to-peer networks are two different network configurations. In a client-server network 
a central computer fulfils service requests from client computers, whereas in a peer-to-peer network each 
computer can act as a client and a server. 


e Network devices can communicate using wired or wireless connections. 


¢ A protocol is a set of rules governing communications on a network. They specify the format for 
communications and how they must be sent/received. 


¢ There are many different protocols, each serving a different purpose. 
¢ A network topology is how the connections between devices in a network are configured. 


SOS ST rer evcanuacnase omit sc Oe : 
$1 What is the difference between a LAN and a WAN? 
S2 What are the advantages of wireless networks compared with wired networks? 


$3 What are the three major protocols used in sending and receiving email? 

S4 What is a protocol stack? 

S5 From memory, draw a diagram showing the bus, ring and star topologies. 

apeircees EL ana an 

C1 Why might a chain of supermarkets use a WAN to connect all its stores? 

C2 Explain the advantages of a peer-to-peer network in terms of privacy when sharing files. 


C3 Most networks in use in schools and offices today are based on the star topology. Why do you think this 
topology is used? 


C4 IMAP is now used much more than POP3. Can you explain the link between this and the popularity of 
smartphones? 


How confident do you feel about your answers to these questions? If you're not sure 
you answered them well, try drawing a concept map showing how the information 
in this section links together. Your concept map can have brief definitions on it and 
some people use colours/pictures to help them memorise the contents. 
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5.2 Network security 


Learning outcomes _ 
By the end of this section you should be able to: 

* describe why network security is important. 

* explain network-related validation and authentication techniques. 

¢ describe security issues associated with cloud storage. 


* describe common types of cyberattack and how to protect software systems from them. 
¢ describe how network vulnerabilities are identified. 


Network security and its importance ‘Keyterm 


Network security covers a wide range of activities that protect data from ON wont 7 ance 
threats to its confidentiality, correctness (integrity) and availability. |= Me et 
| designed to protect a network 


Confidentiality — An organisation’s computer system often holds data and its data from threats such as 
about its people (employees and customers), products or financial viruses, hacker attacks, denial of 
performance. It could not run efficiently, or sometimes at all, without a | service attacks, data interception 
network and this data. It is important (and in the case of personal data, the and theft, and equipment failure. 


law) that the network is not hacked so data is not intercepted or stolen by 
criminals or competitors who might use the data for illegal purposes or 
business advantage. 


Some ways to protect data confidentiality include: 

¢ ensuring only authorised users can access the parts of a network and its 
resources that they have a reason to require, such as its storage (data), 
printers or internet connection. For example, the managing director of 
a company is able to access the financial records, whereas a checkout 
employee is not (see also in the later section ‘Other ways to secure a 
network’). 


¢ stopping misuse — even users who have been given permission to access 
a network might deliberately or accidentally access data they have not 
been given permission to read. 


* encrypting data — if a criminal or other unauthorised person gains access 
to data and it is encrypted then they won't be able to read it without the 


encryption key. 


Correctness — Data is useless unless it is correct. Use of the network to 
communicate and store data must not change the data or allow the data to 
be changed without authorisation. Imagine the seriousness of an error on 
your health records or in a manufacturing control system. 


Availability — A network is useless if data cannot be accessed when it is 
needed. Complex systems are likely to fail at some time (e.g. a piece of 

hardware might stop working and have to be replaced, or program code 
might become corrupted), but it is also important that protection is put 
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Denial of service (DoS): 

an attack on a network that 
attempts to prevent legitimate 
users from accessing its services. 


in place to prevent failures caused by criminals. Virus or denial of service 
(DoS) attacks can, for example: 


* slow down network performance or stop it working altogether, 


¢ delete data; 
* allow data to be stolen or eavesdropped on; 


* alter data or program code. 


Network security importance 


Here are some more specific examples of the importance of network 
security. The data stored on the network could be: 


* required for the running of the organisation — The business might 
have details of its customers, stock and outstanding orders saved on 
the network. If the data were lost and the business failed to fulfil any 
outstanding orders, this could mean losing the trust of customers who 
then go elsewhere. This could lead to the business going bankrupt. This 
has happened to a number of businesses, some of them very large. 


Even a school would struggle to run effectively without its network and 
the data stored on it. Data, such as contact details, registers, timetables, 
medical conditions, handouts for learners, presentations to be shown by 
tutors, are stored on the network and could become inaccessible if the 
network fails for some reason. 


* private and confidential — There are many types of private and 
confidential data that people or businesses wouldn't want to make public. 
For example, a hospital stores patient details on its network, including 
medical conditions and what drugs have been prescribed. There is a legal 
requirement to keep personal data secure. If the network isn’t secure and 
the confidential data is leaked, then the hospital could be sued. Many 
potential patients would be put off using the hospital due to a lack of 
trust in the ability of the hospital to maintain patient confidentiality, 
which could result in the hospital being closed down. 


¢ financially valuable — Many types of data stored on business networks 
might be financially valuable. For example, imagine a business was 
planning a huge sale to increase its revenue and attract new customers. 
If a competitor managed to get hold of the details of the planned 
price reductions before the sale started, it could launch its own sale 
beforehand, undercutting the business. This tactic would reduce the 
chance of the sale increasing revenue or attracting new customers. 


Articles relating to network security can often be found on news websites 
under the technology section. Reading these articles will broaden your 
knowledge about the latest security threats. You can then look for further 
details on specialist information technology websites, where you will 


often find details of what happened and how to protect against that kind 
of threat. 
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Authentication and validation 


Authentication is the process of checking the identity of a user of a 
computer system or network. This is often done by validating a username 
and password against details stored on a central server, but other systems 
of authentication exist, such as using a PIN (personal identification number) 
or fingerprint recognition. If the details you provide match those on record, 
then you are allowed access to the network and its resources: 


Extend your knowledge > 


Passwords have many weaknesses, for example people write them down 

or use guessable passwords (like a pet’s name). Programs exist that 
repetitively try guessing passwords. To improve the security that passwords 
provide, an additional check, such as having to type in the code froma 
portable hardware device called a ‘secure token’ or from an SMS message 
sent to your mobile phone, is being introduced by some businesses. This 

is called two-factor authentication. How does this enhance security? Try 
researching some different two-factor authentication systems and think 
about which ones are likely to be the most secure. 


Think about how you would write a program so that only approved users 
who enter their password correctly can get beyond the log-in screen. 


1 Write the pseudo-code for the username and password checking part 
of the program. 
Write your program. 


3 Discuss how secure you think your implementation is. How could it 
be improved? 


Other ways to secure a network 

There are many different ways to help secure a network, authentication 
alone is not enough. Three others you need to know about are: access 
control, firewalls and physical security. 


Access control 
Once a user has been authenticated they gain access to the network; however, 
you wouldn't want every user on a network to have access to every file. 


' Access control: this determines 
which users have access to which 


data, and what they are allowed 
* read-only access — in this case the user can open the file and read its gore aithat 


contents, but not modify the contents or delete the file; 
* read and write access (modify access) ~ in this case the user can read the 
file, alter the contents and then save the changes. 


Access control is the method that controls whether a particular user will 
gain access to a particular file. Access control also decides if that user gets: 
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A system administrator (often 
shortened to sysadmin) is a 
person whose job is to manage 
and maintain the network 
system. This job includes 
ensuring the network is working 
correctly, setting up users, 
dealing with security issues, 
installing software and many 
other tasks. 


| Hacking: the act of gaining 


| unauthorised access to a 
computer system and the data it 
contains. 


Physical security: controlling 
access to critical parts of a 

* network using physical methods 
(such as locked doors) rather 
than software. 


i 


In some network systems you might need further permission if you want to 
delete a file — this is sometimes referred to as full control of the file. 


Controlling the level of access a user has to a file is useful for many reasons. 
For example, you might want employees of a shop to be able to open a price 
list, but not accidentally or deliberately modify any prices. 


Access controls are set up by an organisation's system administrators 
according to the management's requirements. 


Firewall 

A firewall is a network security system that monitors and controls data that 
is moving from one network to another. Often one of these networks is the 
internet, in which case the firewall sits between the local internal network 
and the internet. 


A firewall inspects incoming and outgoing data and uses a set of rules 

(often known as a firewall policy) to decide whether to allow or disallow 

the data to move from one network to another. The rules in a firewall are 
designed to secure the internal network from a range of potential threats. 
The organisation that owns the firewall can customise the rules so that the 
firewall is suitable for their particular circumstances. Some examples of what 
the rules can do are: 


¢ Stop certain protocols from being used (for example FTP) to stop the 
organisation's data from being potentially copied to an external server. 


* Block data coming from or going to certain network addresses. The 
blocked addresses could be of particular computers, servers or websites, 
or even a range of network addresses that belong to particular countries 
or organisations. 


¢ Stop attempts at hacking the internal network’s servers by disallowing 
data that matches the pattern an attacker would use. 


Firewalls can be software or hardware based. Individual computers (such 
as your home computer or laptop) are likely to have a software firewall 
installed with some default rules to protect your computer from common 
threats. A business with a LAN and internet connection is likely to have a 
hardware-based firewall because these have much more flexibility in terms 
of the rules that can be applied and allow faster throughput of data. 


Physical security 


Physical security ensures that critical parts of the network can only be 
physically accessed by authorised people, such as the network technicians ora 
systems administrator. It also includes protecting against theft of equipment 


and might involve installing a burglar alarm, security tagging and physically 
locking down equipment, etc. 


The starting point for physically securing a network is having the servers 
located in a locked room, normally referred to as the server room. Electronic 
lock systems have many advantages over traditional lock-and-key systems as 
entry/exit times can be recorded and individual cards/fobs can be deactivated 
if lost or if an employee leaves the company. 
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Physical security is very important because anyone with physical access to 
the server can more easily bypass the security provided by the authentication 
system or access control system. Having done so they could copy, modify or 
delete any data on the network, or install malware that would allow them to 
gain remote access whenever they wanted. 


| A business has a network comprising 50 computers, several printers and 
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an internet connection. The business 
* employs two technicians to manage the network: 


* has several departments, including sales, accounts and human 
resources; 


* is run by a managing director, with a manager in charge of each 
department and around 60 other employees. 


1 Create a poster explaining to the employees what authentication is 
and what makes a secure password. 

2 Where should the servers in this business be located? Justify your 
answer and explain who should be able to get physical access to 
the servers. 


Cloud storage and security 


The concept of the ‘cloud’ is discussed in Chapter 4, section 4.2 on page 151. 
This section covers the security issues associated with cloud storage. 


Many of the advantages to cloud storage relate to securing the availability 
of data. 


The cloud storage provider is responsible for the hardware your data is 
stored on. They will need to ensure that the data is accessible and that they 
maintain the hardware and software needed to make the data available. 


The amount of storage available to an organisation can easily be changed 
as and when required — the cloud storage provider can normally make 
extra storage available in minutes. Doing this physicaily would mean 
ordering more hard drives and employing technicians to install them and 
set them up. This would take considerably longer than a few minutes. 
Having data stored off-site (not on the organisation's premises) means 
that it is protected from loss due to fire, theft of computers/servers, 
electrical failure, and so on. As the data is still available the business 
may be carried on from elsewhere. This could be the difference between 
staying in business or going bankrupt. 

Many cloud storage systems also manage the back-up of your data. They 
may take hourly or daily snapshots of your data and be able to restore 
from these in the case of hardware failure or accidental deletion of files. 
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Malware: short for ‘malicious 

| software’. It is used as a generic 

_ term for any kind of software that 
is designed to disrupt the use of 
a computer system. 


Discuss with a classmate the 


advantages and disadvantages of 
moving your school’s files/data 
to cloud storage. Focus on the 

| security and privacy issue that 
the school will need to address. 


However, cloud storage also has a number of security disadvantages. 


You are relying on a third-party storage provider to keep your organisation 
running. What happens if the cloud storage provider goes bankrupt, has a 
catastrophic failure or has its servers stolen? In any of these cases it is likely 
you would be unable to access your data, at least temporarily, but possibly 
permanently. 


Data stored anywhere accessible via the internet carries the risk of other 
people gaining access to it. This might happen through a deliberate attack 
by hackers, or as a result of an accidental configuration error that results in 
the data becoming publicly accessible. Also the internet isn’t a particularly 
secure network and data may be accessed as it is being sent to/from the 
cloud storage provider. The loss of data, particularly of a confidential 
nature, could be devastating for an organisation. 


Users of cloud storage have to assume that the people providing the 
service are trustworthy and that their data is being held safely and securely. 


The cloud storage provider might have its servers located in a different 
country (or countries) to your organisation. Some types of data can only 
legally be stored in countries that have similar data protection laws to 
our own. 


Access to cloud storage is dependent on having a reliable, high-speed 
internet connection available. How will a business operate if the internet 
becomes unavailable for some reason? 


The decision to use cloud storage involves carefully weighing up the 
advantages against the disadvantages. Many of the disadvantages present 
risks to the operations of an organisation, but the impact of some of these 
risks can be reduced. For example: 


Many cloud storage providers will store your data at multiple locations so 
that fire, flood, hardware or electrical failure should have no major impact 
on accessing your data. 


You can encrypt your data so that it is much more difficult to read while it 
is either being transferred to/from the cloud storage provider or stored on 
the provider's servers. 


Contemporary storage and security 


The types of storage systems available are constantly evolving. One recently 
developed storage system is network-attached storage (NAS). 


NAS is a hardware device that is connected to a network to provide file 
storage for any device connected to that network. A typical NAS device 
designed for home use could consist of a single hard drive and associated 
network hardware, whereas an organisation's NAS device might consist of 
many hard drives and the associated network hardware. 
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NAS devices often include a wide range of additional features, such as 


allowing access over the internet, and specialist apps to allow smartphones 
and tablets to access the files stored on the NAS easily. 


Often NAS devices are designed for ease of use rather than being secure. 
Once a NAS is connected to the internet it becomes possible for it to be 
hacked remotely. Home users often make mistakes such as: 


¢ Not changing a device's default password, thus giving access to the stored 
files to anyone who looks up the default password on the internet. 


* Not updating the software running on the NAS. Manufacturers tend to release 
updated software to fix security weaknesses as they are found. If the software 


isn't kept up to date, these weaknesses might be used to gain access to the 
stored data. The security weaknesses are often published on the internet for 
criminals to read. 


There are ways to minimise the security risk to data when using a NAS or similar 


device. Firstly, decide if the data does indeed have to be accessible over the 
internet and if NAS is the correct storage device for the data. If the data needs 
to be available over the internet, then ensure you use a complex password, 
remove any default passwords and ensure that all software patches (updates) 
are applied. 


Another widely used storage device is the USB flash drive. USB flash drives are 
easy to transport, relatively cheap for the amount of storage available and very 
convenient to use. 


The downside is that people can carry large amounts of possibly sensitive 
information around with them on a small drive that is easily lost. Anyone can 
find a lost flash drive and access the information. To overcome this, encrypted 
USB flash drives are available. These are unreadable until a password or 
numeric code is entered on to a very small keypad on the body of the drive 
itself. (See Chapter 3, section 3.4, page 125 for more on encryption.) 


USB storage devices also present an ideal opportunity for an employee to steal 
data, perhaps to sell to somebody else. 


Many providers offer cloud storage aimed at individuals. 

1 Create a table to compare what three of these providers offer. 
Headings for the table should include: Amount of free storage space, 
Cost for more storage, Security measures. 

2 Research and write a brief report that explains what commitments 
these providers make to keep your data secure. Relate this to the 

| security measures you found out about in step 1 of this activity, 
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USB: Universal Serial Bus 
. sockets found on most modern 
, computer systems. 


Cyberattack: any kind of 
| malicious attack on a network- 
connected device. 


Social engineering: any kind of 
attack on a computer system or 
network that exploits how people 
behave and respond to certain 
situations. 

Phishing: attempting to get 
confidential information by 
sending a user a message 

that appears to come from 

a trustworthy person or 
organisation. 


Shoulder surfing: gaining access 
to information by watching 


| 


someone enter it into a computer 


system. 


Cyberattacks 
A cyberattack is any kind of electronic attack on a computer system, server, 
network or other IT device. These attacks might be designed to: 


* gain access to data contained within the system; 
« delete or modify information; 
* make the system unavailable for use; 


¢ physically damage a device connected to the network (usually by overriding 
safety limits). 


Types of cyberattack 

There are many different forms of cyberattack and these are constantly 
changing as technology evolves. Most cyberattacks can be classed as 
exploiting either human behaviour or technical weaknesses. 


Social engineering 

Attacks that rely on exploiting human behaviour are often referred to as social 
engineering because the attacker will ‘engineer’ a situation where the target 
person (or group of people) gives away confidential information or gives access 
to the confidential information. 


Two common forms of social engineering are phishing and shoulder surfing. 


Phishing 

A phishing attack is an attempt to get sensitive, confidential information from 
the user of a computer system or service. Often the information the phishing 
attack is targeting is usernames and passwords or financial details such as bank 
account or credit card details. 


There are various forms of phishing. The most common types are through email 
or fake websites that look legitimate, but phishing can also happen via phone 
calls or instant messaging. 


A typical phishing attack might start with an email that asks the user to update 
details at a bank, online payment system, online auction website or social 
network. Sometimes the bogus reason given for this request is that there has 
been a fraud attempt on the user’s account. 


Once the user clicks on a link within the email, a website that looks and acts like 
the real website will open, but when the user enters his or her login details they 
are passed to the attacker, who will use them for financial gain. 


Phishing takes advantage of people trusting the contents of emails to be 
genuine. Many people don’t bother to look at the web address they have been 
taken to or think about whether their bank is really going to email them to 
request a security update. 


Shoulder surfing 


shoulder surfing means gaining access to confidential information by directly 
observing a user, possibly literally looking over their shoulder, as they complete a 
task. Often shoulder surfing is used to get a person’s username/password or PIN. 
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Often shoulder surfing happens in busy places where the target is 
concentrating on completing a task quickly. A classic example is at a cash 
machine on a street, where the shoulder surfer stands near the cash machine 
and sees the user enter his or her PIN on the keypad. Once the PIN has been 
seen the card is stolen. The card and PIN can then be used to withdraw cash. 


| 1 Research recent cyberattacks. What kinds of organisations are targeted? 
How successful were the attacks? What information was taken? 


2 Some organisations have been known to deny that a cyberattack 


successfully happened or downplay the effects of a cyberattack. Why 
might they do this? What are the risks to a business of lying about 


cyberattacks? 


Technical weaknesses 


Other forms of cyberattack rely on technical weaknesses (vulnerabilities) in the 
system being attacked. Some common examples ar< described below. 


Unpatched software 

Software is very complicated and usually security issues are found as it is used 
in the real world. The maker of the software will normally provide updates 
(referred to as patches) to fix security issues as they are found. 


Often these security issues are discussed on the internet and some people will 
use this knowledge to attack unpatched software to gain unauthorised access 
to information. 


The patches to fix the security issues often have to be manually installed by 

a technician. Sometimes these patches get forgotten about or the organisation 
doesn’t have any technicians to install the patch, so the software remains 
vulnerable. 


USB devices 

You read earlier about the weaknesses associated with USB flash drives, but 
any USB device can potentially be a security threat because it might contain 
malware that could be transferred to your system or copy data to the attacker 
via the internet. 


Eavesdropping 

One form of attack mentioned earlier is an eavesdropping attack. In 
computer security terminology, eavesdropping means intercepting data 
being sent to/from another computer system. In a similar way that a person 
can eavesdrop on a conversation without the speakers knowing about it, 
eavesdropping on a network is simply reading data without actually copying 
or stealing it. The owner of the data might not know that the data has been 
read until it is used by the criminal. 


Security weaknesses such as unpatched software or a USB device might allow 
malware to be installed on the network that allows an eavesdropping attack to 
be carried out, but specialised hardware might also be used. 
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Unpatched software: software 
that hasn’t had the latest security 
updates applied to it, making it 
vulnerable to attack. 


Code vulnerability: a computer 
program (the code) that has 
been written in such a way that it 
creates a security issue that may 
be taken advantage of to gain 
access to the computer system or 
data within it. 


Protecting against security weaknesses 

There are many ways to strengthen computer systems and networks from 
attacks, starting from when software is designed and written, to deciding 
on what kind of operating system should be used and how network security 
provision will be implemented. 


Design and implementation of software 

Security must be considered at the system and software design stage if a piece 
of software is to be as resistant as possible to cyberattack. It should not be an 
afterthought when the system has been implemented. 


The software and system designers will need to consider the following, 
for example: 


¢ What kind of authentication is needed? 


© Dodifferent users need different levels of access to the information stored 
within the system? 


* Should warnings be issued before allowing users to carry out potentially 
risky operations such as copying or deleting large amounts of information? 


¢ Does stored data need to be encrypted? 


¢ What threats will the software face? Will the software be run on computers 
that are connected to the internet? How do they guard against phishing, 
malware and other hacking attempts? 


Once the designers have considered security, they need to look at the 
implementation (writing the code) phase. 


One major issue at this point is guarding against bad programming practice. 
Usually a software project will have many different programmers working 
on the code. Some programmers write poor quality code and don’t consider 
how safe or secure their code is, and sometimes time pressure might make 
even a good programmer take shortcuts to meet deadlines. 


Another major issue is that some methods used in programming might lead 
to code vulnerabilities. A code vulnerability is when the code does the 

task intended, but has been written in such a way that it creates a potential 
security issue. 


More details about the kinds of issues and their impact that need to be 
considered when creating software can be found in Chapter 6 page 216. 


Issues like these can be minimised by having regular code reviews. There 
are two main types of code review. 


* Review by another programmer, usually someone who is more senior and 
has more experience of writing secure code. The reviewer will look at the 
code produced, checking to see if any bad programming practices have been 


used or if any code vulnerabilities are present. This is fairly labour intensive 
and therefore expensive. 
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An automated review. Here, a specialist piece of software is used to examine 
the code. The software will highlight potential issues such as common 
vulnerabilities in the programming language or obvious bad programming 
practice. This software can't find every issue and is also fairly expensive. 


sometimes a combination of both types will be used. If issues are found, then 
the original programmer will be asked to improve the code and another code 


review will be scheduled. Keytems 


Modular testing is important to security because if small problems remain Modular testing: testing each 
they might be used by hackers to gain access to the system or the data it block of code as it is completed 
contains. For example, failing to validate input correctly could lead to a hacker to ensure the code works as 
being able to crash the program and then gain access to the underlying expected. 


system, which could allow valuable data to be accessed. 


Audit trail: a record of activities 


Other security measures to protect from cyberattacks (dt wave Taken plage 


Here are some other methods to reduce the chance of cyberattacks succeeding. ae eal mS dase . 
generated automatically and will 


¢ Use an audit trail. An audit trail is a record of activities that have taken record what has happened and 
place on a computer system, and which cannot be changed. The audit trail who or what made the change. 
is automatically generated and likely to be in chronological order. Ordinary 
users of a system shouldn't be able to read an audit trail. The amount of 
detail in an audit trail varies but is likely to contain date and time of change, 
what change happened, who or what (as in a user or other software that is 
part of the system) made the change. Audit trails allow technicians to figure 
out what happened during a cyberattack, if the attack was successful, what 
sections of the system were accessed and if data was copied or modified. 


e Use secure operating systems. Different operating systems are designed for 
different purposes. Some are designed with security in mind and these are 
likely to be much harder to attack successfully due to the way they have been 
written and the features they offer. When implementing a computer system 
that will contain sensitive or confidential data the choice of operating system 
must be considered carefully. 


¢ Provide effective network security. To keep a network secure requires 
effective management, monitoring and training of its users. Well-educated 
technical staff, with up-to-date training, are needed to keep software patched 
correctly, implement policies that reduce the chance of an attack being 
successful, train users in best practice in terms of security and monitor the 
systems to ensure there is no unauthorised access. 


Identifying vulnerabilities 


Ethical hacking is the branch of computer science that relates to cybersecurity Activity 12 ? 
and preventing cyberattacks from being successful. Ethical hacking is , 
essentially ‘good’ hacking - it is looking for weaknesses in software and A patch has become available for 
systems so that they can be improved — whereas hacking is seen as ‘bad’ - a major web browser. Research 


the risks that the user faces by 
not updating his/her software 
_ and add details of the potential 
risks to your revision notes. 


trying to gain access to a system to steal data or cause damage. 

There are many ways that vulnerabilities can be identified and some of 
these are explained below. Once a vulnerability has been found, steps to 
remove or reduce its impact can be taken. 
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Penetration testing 

Penetration testing (often shortened to ‘pen testing’) is where the IT systems 
of an organisation are deliberately attacked to find any weaknesses. These 
attacks are authorised by the organisation and are therefore legal. The attacks 
might be run by employees of the organisation or by a business that has been 
contracted to run the tests. 


The pen tester(s) is likely to try to gain access to all systems that the 
organisation has and the attacks will often include looking for technical 
weaknesses and trying social engineering methods. 


Once the pen testing has been completed, a report is usually presented to a 
senior manager within the organisation explaining what issues were found and 
the likely impact of them. 


Commercial analysis tools 

It is possible to use software tools to scan a system for vulnerabilities. These 
commercial analysis tools or vulnerability scanners can be either purchased or 
hired. The tools look for common issues and alert the user to them. 


These tools can only identify already-known vulnerabilities and must be kept 
up to date to be effective. They can be used to scan the network from within 
(internally) or from outside (externally). Both of these scans can be valuable. An 
internal scan can show up issues that could be exploited by a rogue employee 
or a hacker who managed to physically get into the building or gain access to 
the network in some other way. The external scan might show vulnerabilities 
that a hacker could exploit from outside the company network. 


One negative aspect is that these analysis tools aren't really restricted in terms 
of who can purchase them, so they could be used by a potential hacker to find 
security vulnerabilities. 


Reviews of network and user policies 

All networks should have written policies that document: 

¢ who is authorised to carry out various activities on the network; 
¢ how and when patches to software should be applied; 

* access controls; 


* password requirements, including how complex passwords should be and 
how often they should be changed; 


¢ how security is set up and maintained on the network: 
* what data audit trails should collect and how long they should be kept for; 


* anything else relevant to the security and maintenance of the network. 


Before an employee is given access to the network they should be givena 


copy of the network user policy to read and sign to show they accept the 
organisation’s rules on the use of the network. 
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This user policy is likely to contain details of: “Exam-style question 


* what is acceptable and unacceptable use of the network; Describe how an email phishing 
escri 


* what will happen to the user if they do something unacceptable (depending —_ attack targeting bank customers 
on the severity of what they do wrong, this could include losing their job); | might work. (4 marks) 


¢ how to report faults, problems and security issues; 


* security information, such as good practice when choosing and using Examtip 


passwords. 
¢ This question tests your 
Over time the hardware, software and usage of a network are likely to change. understanding of what a 
Any policies relating to the network should be regularly reviewed to ensure that phishing attack is and how one 
they are up to date and suitable to help maintain network security. might work. 


* A good answer should state 

the chain of events that might 
Find your school’s Network User Policy (you might need to ask your lead to someone entering their 
tutor for a copy). bank logon details into a fake 
bank website. 


1 Review the policy. Has it been updated recently? Is it easy to 
understand? Is it detailed enough? 


2 Using your school’s policy and any others you can find on the 
internet, try to redraft the policy to make it easy to understand for 
non-specialist users of IT. 


e Authentication is the process of checking the identity of someone trying to use a computer system. 


¢ Cloud storage is storing data using a third party, usually in a system connected to the internet. 
¢ Acyberattack is any kind of electronic attack on a computer system, server, network or other IT device. 
¢ There are many different types of cyberattack; most exploit technical weaknesses or human behaviour. 


* Many different measures need to be taken to prevent cyberattacks from being successful. These start at the 
design stage of the system and include keeping systems up to date, training staff and proactively testing the 
security of the network regularly. 


S1 What is the most common authentication method in use? 
S2 What is access control? 
$3 What is social engineering? 


C1 In your own words, summarise the different types of cyberattack. 
C2 The number of cyberattacks happening is increasing each year. Why do you think this is the case? 


How confident do you feel about your answers to these questions? If you're 
not sure you answered them well, try writing a list of questions then work 
through this section adding relevant points as you find them. 
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5.3 The internet and the World Wide Web 


By the end of this section you should be able to: 
¢ describe what the internet is and how it is structured. 
¢ describe what the World Wide Web (WWW) is and the meaning of related terminology. 


Internet Service Provider (ISP): 
an organisation that provides its 
customers with a connection to 
the internet. 


Router: a piece of networking 
hardware used to forward 
packets of data from one network 
to another. 


The internet 


The word internet is a shortened form of the words ‘inter’ and ‘network’, 
which together means interconnected networks. It’s a good idea to think 
of the internet as a network of networks (i.e. the biggest WAN of all). Your 
school, many organisations and many people's homes all have a network 
that is a part of the internet. 


The reach of the internet is global — all countries have multiple cables, fibre- 
optic links and satellites that connect it to other countries. These links plus 
others within each country can be thought of as the internet backbone - the 
connections that link all the networks together. 


How the internet works 

Your computer or network is likely to be connected to the internet using an 
Internet Service Provider (ISP). An ISP is an organisation that provides 
internet connections. This connection to the internet can be provided ina 
number of ways, but the most common is through telephone lines. 


The internet uses the TCP/IP protocol stack to allow communication between 
all the different networks. More detail about protocols can be found in 
Section 5.1, page ios. 


The Internet Protocol (IP) part of the stack provides the technical rules on 
addressing; that is, it provides each device or network connected to the 
internet with a unique address to send data to/from. 


As discussed in section 5.1, two versions of IP. Internet Protocol version 4 
(IPv4) and Internet Protocol version 6 (IPvé6), are currently in use. 


The networks that are a part of the internet are linked together using 
routers. A router is a piece of networking hardware that forwards packets 
between networks. A router has a routing table that is essentially a list of 
rules stating where to send packets for different destinations. When a router 
receives a packet it looks in the packet header for the destination address 
and then uses the rules within the routing table to decide where to send it. 


It is likely that a packet will need to be forwarded between several routers 
before it reaches its destination. 


When an internet-connected computer wants to send data to another whose 
IP address it already knows the following happens. 
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1 The sending computer splits the data into packets. 


Each packet has a header that contains the sender's address, the 


destination address, the current packet number and the total number 
of packets that make up the data. 


Each packet is sent to your ISP. 


Your ISP will have a router. This router will inspect the packet header, 


and decide where to send the packet depending on its destination 
IP address. 


4 The packet is likely to end up at another router, which will! again look 
at the destination IP address and forward it on. This can happen many 
times before the packet reaches its destination network and intended 
recipient. 

5 Once a packet reaches its destination, the receiving computer will 
reassemble the data from the packets. Depending on the protocol 
being used, the packets might arrive in the wrong order and have to 


be put back in order using the information in the packet header. Activity ‘4 ? 
Find out what services other 
The internet, like any network, has many different services running on top than the World Wide Web and 
of it. Email and the World Wide Web are two of the most commonly used email use the internet. Create a 
services to which the internet provides access. list with a brief definition of each 
service, what protocols it uses 
The World Wide Web and how it uses the internet. 


The World Wide Web (often shortened to WWW or ‘the Web’) is a service that 

runs on the internet. It provides access to web pages (a type of document), ‘Didyouknow? 
which are linked together using a hypertext system. Hypertext is simply 

text that contains links (hyperlinks) that can be selected to go to another The World Wide Web was 


document/web page. These documents can contain text, graphics and invented in 1989 by a British 
—— scientist called Tim Berners- 


Lee. Can you find out when the 
Accessing the World Wide Web internet eceinventad? 
You access the World Wide Web using a program called a web browser. There 
are many different web browsers to choose from but they all use the same 
protocols and use the internet to transfer information. The web browser's 
job is to convert the data received from a web server (see page 186) toa 
human-readable format. When a user accesses a website using the client-- 
server system mentioned on page 174 in section 5.1, the web browser is the 
client and the web server provides the service to the web browser. 


How the World Wide Web works 

The World Wide Web uses the internet to transfer data from one computer 
system to another. The computer being used to access the World Wide Web 
connects to the internet using an ISP and will be running a web browser. 


Getting a web page or other file using the World Wide Web involves a large 
number of steps that use different protocols and standards. 
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HyperText Markup Language 
(HTML): essentially a text 

- document that contains any text 
to be displayed along with: 


¢ details of how the text should 
be formatted (e.g. font size, 
colour etc.); 


* details of any hyperlinks and 
where they link to using a URL; 


¢ details of any objects such as 
pictures or videos that should 
be shown within the web page 
when it is displayed. 


¢ As this is question is for 
1 mark, only one answer is 
correct. 


1 The user of a computer enters the web address of the information he or she 
wants to look at. A web address is also known as a URL (Uniform Resource 
Locator). A URL may be for a web page or an individual file, for example 
a URL could be just for a particular picture file rather than a web page. 

2 The computer uses a system called the Domain Name Service (DNS) to 
find the IP address of the required web server. 

3. The web browser connects to the web server using the IP address and 
requests the relevant web page or other object (picture, sound or 
video file). 

4 Aweb page is transferred from one computer (normally a web server) 
to another using HTTP or HTTPS, which were covered in ‘Network 
protocols’ in section 5.1. 

5 Data sent from a web server to a web browser is in HyperText Markup 
Language (HTML) format. The web browser displays the web page as 
described by the HTML. 


<!DOCTYPE html> 

<html> 

<head> 

<title>Page Title</title> 
</head> 

<body> 


chl>This is a Heading</ni> This is a Heading 


<p>This is a paragraph.</p> _ This isa paragraph. 


</body> 
</html> 


Figure 5.7 How a web browser display of a web page is generated from HTML 


Which of the following protocols would not be used when transferring a 
web page from a web server using the internet? (1 mark) 
ay GP 


ber 
c IMAP 
die 


If the user selects a hyperlink, then the URL that the link points to will be 
loaded using the method outlined above. The URL might point to a web 
page on the same web server or on a different web server. This is where the 
‘web’ part of World Wide Web comes from - different pages and web servers 
are linked together in a complex pattern. 
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<a difference between the internet and the World Wide Web. | ¢ This question tests your 
marks 


understanding of how the 
internet and World Wide Web 
are related. 


¢ A good answer should state 


Draw an annotated diagram that shows how a packet gets from one Ble ule Ue Uin erie 


computer to another through a number of routers on the internet. are and how the WWW uses 


2 The internet is capable of continuing to work even when some parts Diente, 


of the physical network fail. Research and discuss how the internet 
deals with faults of this kind. 


* The internet is a global network of networks. It is used to transfer data between different computer systems. 


¢ The internet has many different services running on top of it. wo common services are the World Wide Web 
and email. 


e The World Wide Web runs on top of the internet. Although people often mix the two terms up, they are not 
the same and as a learner studying Computer Science you need to use the terminology correctly. 


$1 Write your own definition of what the internet is. 

S2 Write a definition of what the World Wide Web is and add an explanation of how it is different from 
the internet. 

S3 Describe what happens when you enter a web address into a web browser. 


C1 Explain in your own words the role of all the protocols used when requesting a web page from a web server. 
C2 Explain why you think the internet and World Wide Web are so embedded in modern life. 


How confident do you feel about your answers to these questions? If 
you're not sure you answered them well, try finding some online videos 
that explain the concepts visually. 
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6.1 Computing and the environment 


Learning objectives 


By the end of this section you should be able to: 


* explain how the manufacture, use and disposal of computing technology damages the environment. 


* describe efforts being made to reduce the environmental damage caused by computing technology. 
* give some examples of how computing technology is helping to protect the environment. 


It's hard to imagine what life would be like without the internet, social 
media, search engines and e-commerce. Global demand for smartphones, 
tablets and other forms of computing technology, including embedded 
processors, web servers, sensors and hard drives, is growing rapidly year on 
year. At the same time, the pace of new product releases and consumers’ 
desire to own the latest model is shortening the lifespan of these devices. 
The average life expectancy of a smartphone, for example, is estimated to 
be less than two years. The more computing technology we buy, the more 
we throw away. 


Not surprisingly, the manufacture, use and disposal of computing 
technology have a significant impact on the environment, using up 
dwindling resources of non-renewable materials, creating massive piles 
of potentially harmful e-waste, consuming vast quantities of energy and 
damaging people’s health. 


Manufacture 


Manufacturing a smartphone, a PC or any other piece of computing 
technology is a complex process, starting with the extraction of raw 


materials and ending with the finished item being shipped to the customer, 


with lots of stages in between. This makes it difficult to determine 


accurately the overall environmental impact, although there’s no doubt that 


it’s considerable. 


Raw material extraction 

A number of non-renewable natural resources are used in the manufacture 
of computer products. They include sand (to make glass for screens), 

oil (used to make plastics) and various metals used in wiring and circuit 


boards. Some of the metals used, such as silver, gold, copper and palladium, 


are precious and in short supply. Others, such as arsenic, cadmium and 


chromium, are hazardous and pose a serious health risk. Radioactive metals 
used in computer products, such as uranium and thorium, can contaminate 


air, soil and groundwater, and are toxic to human health. 


In some regions of the world, mining of raw materials is poorly regulated. 
Excavation causes extensive damage to the local environment, scarring 
the landscape with unsightly holes and waste heaps, contaminating water 
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Computing technology: an 
all-encompassing term referring 
to the hardware, software and 
infrastructure that underpin 
current and emerging computer 
systems. 


e-waste: any form of discarded 
electronic equipment, including 
, computing technology. 


Try to stay up to date with 
computer science news, so 

that you know about emerging 
technologies and are aware of 
current issues. The BBC's Click 
website is a good starting point. 
It's also worth checking out the 
technology section of the BBC 
website and subscribing to the 
cs4fn magazine. 


Extend your knowledge fi 


China is the world's largest 
producer of rare earth metals — a 
group of 17 chemical elements 
that, due to their unique 
magnetic, luminescent and 
electrochemical properties, help 
improve the performance of 
computing technology and make 
it more energy efficient. 


Extend your knowledge fw 


The United Nations University 
has estimated that the 
manufacture of a computer and 
monitor weighing 24 kilograms 
requires ten times the amount 
of fossil fuels (240 kilograms), 
approximately the same weight 
of chemicals and around 1,500 
litres of water. 


supplies and endangering wildlife habitats. Poorly equipped miners working 
in dangerous conditions run the risk of being seriously injured and are also 
susceptible to long-term respiratory illnesses, such as silicosis, bronchitis or 
lung cancer. 


Production 

Once extracted, the raw materials are shipped to factories — often thousands 
of miles away - to be manufactured into components, such as circuit boards, 
chips, screens, disk drives and cases. 


In turn, the components are dispatched onwards for assembly into finished 
products. 


The manufacture of computing technology is energy intensive. Large 
amounts of non-renewable fossil fuels, such as coal and oil, are used 
during the process. Burning fossil fuels produces carbon dioxide (COz) and 
contributes to global warming. 


Semiconductors are present in every piece of computing technology. 
Manufacturing semiconductors is highly water intensive. For example, a 
factory producing 40,000 semiconductors a month uses around 20 million 
litres of water a day —- ona par with the consumption of a city with a 
population of 60,000. This can result in water shortages in areas where 
semiconductor factories are located, and untreated wastewater discharge 
can cause environmental pollution. 


This table lists six of the most hazardous materials used in the manufacture 
of computing technology. 


Cadmium A metal used in the manufacture of rechargeable 

batteries, printer inks and toners. 

Lead ' Ametal used in the manufacture of circuit boards and 
cable sheathing. 

Mercury _Ametal used in the manufacture of LCD screens. | 

Hexavalent A chemical compound used to make casings. 

chromium 

Polychlorinated Toxic compounds added to plastics, circuit boards, and | 

biphenyls (PCBs) _ connectors to make them more fire retardant. 

Polybrominated 


diphenyl ethers 


Research indicates that exposure to these materials is harmful to human 
health, causing both physical and neurological damage. Furthermore, chemical 


emissions and wastewater from manufacturing plants put people living in the 
vicinity at risk. 
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There is growing recognition of the need to address this problem. 


The EU Restriction of Hazardous Substances (ROHS) Directive was 
transposed into UK law in 2013. It restricts the use of all six of the materials 


listed in the table above, forcing manufacturers of computing technology to 
replace them with safer materials. 


At the same time, governments are imposing tough recycling targets 
designed to ensure that more reusable material is recovered from redundant 
computing technology, and reused so that fewer raw materials are needed 
and reserves of scarce resources are protected. 


Growing public awareness is putting pressure on manufacturers to improve 
working conditions in their plants and impose stricter requirements on their 
component suppliers. 


Usage 

The amount of energy consumed in the manufacturing process pales into 
insignificance when compared with the energy required to keep mobile 
phones, computers, networks, telecommunication links, etc. up and running 
day after day. Even though each individual device doesn’t require a huge 
amount of electricity, close to two billion connected PCs and laptops and 
more than six billion mobile devices collectively do. 


in recent years considerable efforts have been made to improve the energy 
efficiency of computing devices. However, the amount of energy they 
actually consume depends on how they are used and what they are used for. 
The task that a computer is performing and the software being used are key 
determinants of the actual energy usage. High-end applications, complex 
calculations, 3-D modelling and video games are particularly power hungry. 


Cloud computing (see section 4.2, page 151) and data centres in particular 
are major energy guzzlers. Vast amounts of electricity are needed to power 
and cool all the computer equipment that is needed, putting them ahead of 
the aviation industry in terms of the energy they consume. The worst culprits 
are the small, inefficient data centres hosted by private organisations and 
government departments, which tend to be far less efficient than the large 
facilities operated by cloud providers such as Google® and Apple. 

Energy efficiency measures and the use of renewable energy can 
significantly reduce the carbon footprint of data centres. Facebook, for 
example, has built a huge data centre in northern Sweden, just 100 km 
south of the Arctic Circle. Its location was selected because of its access 

to renewable hydroelectricity and the cold climate that helps to keep the 
servers cool. 


Data centres consume large amounts of energy. 


_1_ Research what this energy is used for. 
2 Identify four measures that can be taken to make data centres more 


environmentally friendly. 
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A lot of energy is wasted while 
a computer or printer sits idle. 
Using the ‘sleep mode’ when a 
device is not in use can reduce 
consumption by more than 

50 per cent. 


Research why using a laptop 
rather than a desktop is more 
energy efficient. 


Carbon footprint: the amount of 


carbon dioxide an individual or 
organisation produces as a result 
of the energy they consume. 


Disposal 

The disposal of redundant computing technology represents another 
serious threat to the environment. The quantity of e-waste is growing at a 
tremendous pace. According to the UN's StEP Initiative, e-waste will soon 
weigh as much as eleven of the great Egyptian pyramids. 


Although great efforts are now being made to recycle more e-waste, large 
amounts are still shipped overseas to developing countries where they are 
dumped in landfill sites. This can have serious consequences for the environment 
and public health. The problem is compounded by the fact that the developing 
nations themselves are generating more and more waste of their own. 


E-waste that is not recycled properly can be a serious health and environmental 
issue. As you know, computer products contain a whole host of dangerous 
materials. Once a computer is dumped in a landfill site, the likelihood is 

that some, if not all, of these toxic substances will leak out into the ground, 
contaminating water supplies, infiltrating the food chain and polluting the air. 


For every one million mobile phones, 24 kg of gold, 250 kg of silver and nine 
tonnes of copper can be recovered. The presence of these valuable metals 
in old computing technology is a powerful incentive for local people living 
near the landfill sites, many of whom are desperately poor, to try to recover 
them. However, dismantling old computer equipment without protective 
clothing and specialist training is extremely dangerous. People who do so 
risk exposure to hazardous materials such as mercury and lead and are in 
danger of inhaling toxic fumes. 


Electronic waste is expected to top 60 million tonnes globally by 
2017 - an increase of a third in five years. 


‘One person’s cast-off is another person's treasured possession.’ 
Research, then briefly describe two initiatives that aim to prolong the 
life of pre-owned computing technology. 


The Waste Electrical and Electronic Equipment (WEEE) Regulations 

(2013) set targets for the collection, recycling and recovery of computing 
technology and other electronic items. They apply to businesses but not to 
individuals. 


The aim of responsible recycling is to recover valuable metals and reusable 
components such as plastic, glass and metal, and to dispose of dangerous 
substances safely. 


Major manufacturers of computing technology now have recycling 
programmes. In developing countries there are some promising initiatives 


to create state-of-the-art recycling plants that can turn e-waste into an 
e-opportunity. 
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1 Give three possible environmental impacts of using computing Green computing indicates 
technology. the efficient use of resources 


in computing. It makes 
companies and individuals 
responsible for producing 
and using technology with 
a minimal carbon footprint. 
The main emphasis is on 


2 Suggest one possible action that could be taken to reduce each 
of them. 


Preserving the environment 


The picture is not entirely bleak. Computing technology is at the heart energy efficiency, use of 
of efforts to combat climate change, provide disaster warnings, protect safer materials during the 
endangered species and habitats, and reduce energy consumption. manufacturing process and 


promoting environmentally 
friendly systems. 


Climate change 


NASA is analysing satellite data and measuring sea surface temperatures to 
learn more about how and why sea levels are rising. 


Networks of wireless sensor probes are used to gather information about 
glaciers. The probes are placed under the surface of the ice and measure 
temperature, pressure, stress, weather and sub-glacial movement. A base 
station collects the data from them. The system is helping scientists to 
understand more about the speed at which glaciers are melting. 


Researchers at the University of Oxford are using spare home computer 
time to establish if global warming is to blame for heavy flooding in the UK 
in recent years. 


Early warning 

Tsunami early warning systems use sensor networks to detect approaching 
tsunamis and a communications infrastructure to issue timely warnings so 
that coastal areas at risk can be evacuated. 


Conservation 

Information from GPS and satellites is being used to track Malaysian 
elephants. The results are analysed by computer to help improve 
conservation strategies and assess the effectiveness of the Malaysian 
Government's elephant conservation programme. 


Miniature transponders fitted to bees allow scientists to study the effects of 
disease and pesticides. 


Mobile phones are being used to listen out for illegal logging activities in the 
rainforest and provide rangers with real-time alerts. 


Energy 

Engineers at Manchester Metropolitan University are working on a project 

to make buildings more energy efficient. Sensors in each room monitor A lioness with a tracking device. 
light levels, temperature, how many people are present and electricity GPS collars help to monitor and 
consumption. Real-time analysis of the room data enables automatic conserve wildlife. 


adjustment of electricity usage. 
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The latest generation of giant solar energy farms uses sensors to track 

the movement of the sun and computer-controlled motors to adjust the 
position of the solar panels to optimise power generation. Developments in 
battery storage allows surplus electricity to be stored. 


Make sure you give real positive Discuss the impact of computing technology on the environment. 
and negative examples of how (6 marks) 

computing technology affects 

the environment. 


¢ Some of the materials used in the manufacture of computer components are non-renewable and in 
short supply. Others are dangerous and pose a risk to human health. 


° The Restriction of Hazardous Substances (ROHS) Directive restricts the use of hazardous materials in 
computing technology, forcing producers to find more environmentally friendly alternatives. 


¢ Computing technology consumes huge amounts of energy. Data centres are one of the worst culprits. 


Energy efficiency measures and use of renewable energy can significantly reduce the carbon footprint 
of computing technology. 


There is a possible health risk, especially for children, from exposure to the electromagnetic fields 
generated by wireless devices, such as smartwatches and smart clothing. 


Unregulated disposal of e-waste in landfill sites poses a significant threat to the environment. 


The Waste Electrical and Electronic Equipment (WEEE) regulations set targets for responsible recycling 
of e-waste. 


Computing technology is helping to preserve the environment in a number of ways, including monitoring 
and modelling climate change, conservation and smart energy. 


angthen | 


$1 Identify two hazardous substances used in the manufacture of computing technology. 

S2 Which UK law restricts the use of hazardous substances in the manufacture of computing technology? 

$3 Why is dumping e-waste in landfill sites harmful to the environment? 

S4 List two ways of reducing the environmental damage caused by data centres. 

S5 List two ways in which computing technology is helping to preserve the environment. 

Challenge = la 
C1 Summarise the health risks associated with the manufacture and disposal of computing technology. 

C2 Describe three ways in which computing technology can help to reduce energy consumption. 


How confident do you feel about your answers to these questions? 


If you're not sure you answered them well, reread this section and have 
another go at the activities. 
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6.2 Privacy 


Learning outcomes CN 


By the end of this section you should be able to: 


* explain why computing technology poses a threat to privacy. 


* weigh up the benefits and drawbacks of giving away personal information. 


* describe the legislation that protects against computer misuse. 


Now that you know about the damaging environmental impact that 
computing technology has, does it make you have second thoughts about 
swapping your smartphone for the latest model? Might you decide that your 
concern for the environment outweighs your desire for a new phone? If so, 
you are making an ethical decision. Ethics relate to what is right and wrong 
and govern a person’s behaviour. 


An action might be legal, but not necessarily ethical. For example, it’s perfectly 
legal to leave your old desktop computer gathering dust in the attic, but if you 
know that someone in a developing country would benefit enormously from 
having it, the right thing to do might be to dust it down and pass it on 

to them. 


Computing technology confers a wide range of social and economic benefits, 
but it also creates a host of challenging ethical issues. Privacy and security 
are two of them. 


While most people would agree that computing technology has helped to 
create a much more open society, some would argue that it comes at too high 
a cost. The amount of personally identifiable information that is gathered, 
stored and analysed represents a massive invasion of privacy. 


Personal data 

Every time you post an update on social media, sign up for an online 
account, use a web-based email service or a search engine you are adding, 
knowingly or unknowingly, to an enormous hoard of personal data that is 
held about you — where you live, what you look like, who your friends are, 
your likes and dislikes, your bank account details, products you're interested 
in buying, the route you take to school each morning. 


This personal data is stored on servers that belong to online services, such 
as Facebook and Google®, not to you. 


Every organisation you come into contact with, not just the online 
companies, is likely to collect information about you. Your school, for 
example, stores your attendance record, your end-of-year exam results, 
which books you've borrowed from the library, the after-school activities you 
take part in and much more besides. Does this worry you? 
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Ethics: a set of moral principles 
that govern a person's behaviour. 


Privacy: the right to be left 
alone and free from unwanted 
scrutiny and intrusion. 


Personal data: information that 
is personal and unique to an 
individual. 


Facebook has the biggest 
database of faces in the world, 
with over 350 million photos 
posted and tagged to the 
website every day. 


Identity theft: the stealing 

of another person’s personal 
details, such as their bank account 
number, sort code or passport 
number, for the purpose of 
making purchases and running up 
debts in their name. 


Some people are very concerned about the amount of personal information 
that is collected, often without their consent and over which they believe 
they have little or no control. They are worried about who has access to it, 
what they are using it for, how secure it is and how accurate it is. 


Weak security could result in personal information falling into the wrong 
hands, making people vulnerable to phishing attacks, scams, identity theft 
and fraud. 


Sometimes the information is inaccurate, but getting it changed or removed 
is extremely difficult, if not impossible. {t's not unheard of for inaccurate 
information about a person to follow them throughout their entire life, 
affecting how they are seen and treated by others. 


The UK Data Protection Act (1998) controls how organisations and the 
government can use personal data. It specifies the following principles. 


¢ Data must be processed fairly and lawfully. 


¢ Data must be obtained and used only for the specific and lawful 
purposes for which it was collected. 


¢ Data must be adequate, relevant and not excessive. 
¢ Data must be accurate and up to date. 

¢ Data must be kept for no longer than necessary. 

¢ Data must be kept secure. 


e Data must not be transferred to regions not bound by similar 
principles of the Act. 


Activity 5 ? 


1 What information can you find out about yourself by typing your 
name into a search engine? Is it accurate? What sort of impression of 
you does it portray? 


2 Research the Safe Harbor Decision privacy principle. Why hasthe EU 
declared it invalid? | 


One reason why so many people voluntarily give away information about 
themselves is that it enables an organisation to understand their needs 
better and provide them with a more personalised service. For example, 
setting up an account with an online supplier makes it faster and more 
convenient to purchase from them. 


But do these benefits outweigh the drawbacks? 


You might not mind that an online retailer knows who your favourite band 
is if it means you get to hear quickly when their next album is released, but 


is it right to target a financially vulnerable person with adverts for products 
they will want but can’t afford? 
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Cookies are small data files that keep a record of your web browsing 
history. They record which websites you visit and how often, which 
products or services you buy or show an interest in. Cookies enable 
online stores to learn a lot about you. 


The Privacy and Electronic Communications Regulations (2011) gives 
consumers the right to opt out of having data about their browsing 
habits collected in this way. 


Unlike a cookie, spyware is a computer program stored on your hard 
drive (usually without you realising it’s there) that collects information 
about you and transmits it to a third party. It represents another 
serious threat to privacy. 


Big data 

Data analysts are able to learn more and more about us and gain insights 
into our behaviour by analysing huge volumes of personal data gathered 
from various sources. 


Analysis of so-called ‘big data’ can benefit society. For example, by helping 
to identify adverse side effects of drugs that might otherwise go unnoticed, 
optimising energy use in cities and providing insights into the spread of disease. 


But is the price we pay too high? Where do we draw the line? Big data 
comprises large amounts of information, each piece of which on its own could 
be seen as being harmless, for example your phone number, what music you 
download, your hobbies, etc. However, when collated together these individual 
bits of information produce a very accurate, detailed profile of an individual, 
revealing far more about them than they might have willingly disclosed. This 


might lead to the individual becoming a victim of identity theft or an intruder Find two examples of how 
illegally accessing personal information through social engineering. (See , society is benefiting from big 
‘Personal data’ on page 213.) data analysis. 4 


Surveillance 
Have you any idea how often you've been watched on CCTV today? Could 


a drone have been hovering overhead taking aerial photographs of you on 
your walk to school? If you've driven anywhere by car, travelled by public ‘Keyterms 


transport or been in a shop, the chances are you've been recorded by some Surveillance technology: 

form of surveillance technology. CCTV, drones, number plate 
Most people are willing to allow the security forces to use surveillance recognition, bugging and 
technology to track people’s movements and tap their phones if it enables tracking devices used to monitor 
them to uncover terrorist plots. But what if it were used by companies to and record people’s activities, 
monitor your shopping habits or by criminals noting the time you leave often without their knowledge. 
the house each morning? it’s not unheard of for employers to use hidden Wi ctletblower <omecna une 
cameras to check up on their staff. Is this acceptable? draws attention to the activities 
Some people believe that use of surveillance technology goes too far. In of an organisation or person 
2013 Edward Snowden, a so-called ‘whistle-blower’, raised awareness of the believed to be acting illegally 
extent to which governments worldwide are now monitoring and spying on or unethically. 


their citizens. 
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The right to privacy must be 
balanced against the needs of 
society. Describe two situations 
in which you believe an 
invasion of privacy is justified. 
Explain your reasoning. 


Location-based services: 
services that enable people 
to access and share real-time 
location information online. 


Activity 8 ? 


Describe three ways in which 
location-based services can 
benefit a user and three risks 
associated with their use. 


Encryption 


_ Cookie cleaners, anti-spyware and ad 
| blockers 
Identity management services 


| Password managers 


UK police track vehicle movements in real time using automatic 
number plate recognition and CCTV. Records can be kept for two years 
to be analysed for intelligence. 


CCTV combined with facial recognition software enables the police to 
identify and track specific individuals. 


Location-based services 

With the help of location-based services and Wi-Fi, people can share 
their current location, arrange to meet up with friends nearby, check in to 
a venue, find their way to a particular location and much more. A drawback 
is that location-based services also allow other people to track your 
movements, find out where you live and what you are doing. This can be 
dangerous and represents a huge invasion of privacy. 


- Privacy-enhancing tools 


Privacy-enhancing tools, while not 100 per cent effective, do give some 
protection against privacy invasion. This table lists some of the most 
popular of these tools. 


Prevents unauthorised people from reading your data. 


Software that detects and removes cookies, spyware and adware installed on 
your computer. 


' A trusted third party holds evidence of your identity and issues you with 
an identifier that enables you to conduct transactions with other parties 
without revealing any personal information about yourself. 


Stores all your website login information in an encrypted password 
database with a master password, which is the only one you have 
to remember. 


Cyber-security 


As you learnt in Chapter 5 (see page 192), hacking represents a serious 
security threat. 


The Computer Misuse Act (1990) makes hacking a crime. It identifies three 
types of illegal activity. 


* Unauthorised access to computer material, either a program or data. 


* Unauthorised access with intent to commit further offences (e.g. 
accessing personal data about a person so as to steal their identity). 


* Intentional and unauthorised destruction of software or data (e.g. by 
installing malware). 
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A 2015 amendment to the Act grants immunity from prosecution to the ‘Didyouknow? 
security services, enabling them to hack data on laptops and mobile phones, 

and in databases belonging to suspected criminals. Sony was the victim of a major 
cyber attack in 2014. The 
company claims that hackers 
Activity 9 working for the North Korean 
Government were to blame. 
Private emails were disclosed 

to the public, and a number of 


Research the ‘Carphone Warehouse Data Breach’ of 2015. How many 
customer records were thought to have been stolen? What are the 


" implications of this breach for both the customers and the company? 
—— ~- —: na alg ot unreleased films were made 


available on sharing websites. 


* Computing technology enables organisations to gather, store and analyse vast quantities of personal 
information about the people they come into contact with. 


e Individuals give away all sorts of personal information about themselves online. 


¢ Collecting and analysing information about the people they come into contact with enables organisations 
to provide a more personalised service. 


¢ Big data analysis benefits society at the expense of many individuals’ privacy. 


¢ Surveillance technology helps keep us secure, but encroaches on our privacy. It is difficult to determine 
what level of surveillance is acceptable. 


¢ The Computer Misuse Act (1990) makes hacking illegal. 


S1 Describe two ways in which an individual's personal data could end up stored in databases owned by 
a third party. 

$2 What might happen if personal information falls into the wrong hands? 

$3 List two privacy-enhancing tools and describe what they do. 

S4 List the activities that the Computer Misuse Act (1990) makes illegal. 


C1 Why do some people decide that the benefits of revealing personal information about themselves 
outweigh the drawbacks? 
C2. Describe a situation where the right to privacy is less important than the needs of society. 


How confident do you feel about your answers to these questions? If you're 
not sure you answered them well, reread this section and have another go at 


the activities. 


217 


6.3 Digital inclusion 


Learning OULCOmes 


By the end of this section you should be able to: 
¢ explain how people benefit from being ‘technology-empowered’ and the disadvantages of being 


‘technology-excluded’. 


¢ describe measures that are being taken to promote digital inclusion. 


Digital inclusion: ensuring 


that everyone has affordable 
access to computing technology 
and the necessary skills to take 
advantage of it. 


Digital divide: the gap between 
people who are technology- 
empowered and those who are 
technology-excluded. 


Computing technology is a great enabler, giving many people access to 
news, information, products and services at any time wherever they are. But 
those people who neither have the opportunity nor knowledge to use this 
technology are excluded from the advantages it provides. Is this fair? 


Digital inclusion is about providing everyone with affordable access to 
computing technology and the skills to use it. 


The gap between those who are ‘technology-empowered’ and those who are 
‘technology-excluded’ is known as the digital divide. 


There is a digital divide between industrialised and developing countries 
and also between people who live in the same country. 


Impact 


There are many reasons why technology exclusion is not a good idea. 


Information and | The internet is becoming the default option for accessing 
services = =—_—_ information, public services and entertainment. 
‘Employment | Having poor digital literacy skills makes it harder 

_to find a job and limits employment opportunities, 
. relegating individuals to poorly paid work with little 
| a ____ prospect of progression. | : 
_ Democracy The internet gives people a voice and lets them 


| 


| _ express their views to a worldwide audience. This is 
| particularly important where citizens have limited 
freedom of expression. 


Economic growth | Businesses that are able to exploit computing 
| technology to the full have a competitive advantage 
over those that can't. 


| 
— —_ , — | 
Saving money _ Paying bills and shopping online often saves | 
consumers money and gives them better protection. | 


Social isolation Having access to the internet helps people to keep in 
' touch with friends and relatives. | 


Towards digital inclusion 


Data from the Office for National Statistics suggest that the UK is making 
good progress towards achieving digital inclusion for its citizens. Figures for 
2015 for England, Scotland and Wales show that: 
¢ the internet was accessed every day by 78 per cent of adults 

(39.3 million), compared with 35 per cent (16.2 million) in 2006; 
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a Digital inclusion 


* 96 per cent of adults aged 16 to 24 accessed the internet ‘on the gO, 
compared with only 29 per cent of those aged 65 years and over; 


* 61 per cent of adults used social networking and, of those, 79 per cent did 
SO every day or almost every day; 


° 76 per cent of adults bought goods or services online: 


* 86 per cent of households (22.5 million) had internet access, up from 
57 per cent in 2006. 


Other industrialised nations in North America, Europe and Northern Asia are 
also doing well. 


The same can't be said for other parts of the world. According to the United 
Nations’ State of Broadband report (2015), billions of people living in the 
developing world are still without broadband internet, including 90 per cent 
of those living in the poorest nations. 


Age, disability, disinterest, poverty and cultural norms all play a part in 
digital exclusion, but lack of connectivity is one of the major causes. 


The good news is that, according to the World Bank. 77 per cent of the 
world’s population already live within range of a mobile phone network. In 
areas with a limited or non-existent landline infrastructure, mobile phone 
technology can fill the gap. Even though the number of phones per 100 
people in poor countries is much lower than in the developed world, they 
are having a huge impact. 


Did you know? 


Facebook plans to use drones and satellites to bring the internet to 
Africa. They have also developed an app, called Free Basics, which 
provides free basic access to services. 


Activity 10 eee 
Use the internet to find some actual examples of how mobile phones 
are being used in Africa or elsewhere to promote digital inclusion. 

Write a brief report summarising your findings. J 


Efforts to bridge the digital divide require more than simply giving people 
internet access. The UK's digital inclusion strategy sets out the actions that 
the government and its partners are taking to reduce digital exclusion. 


Ambitious digital literacy programs are 
under way in India, Kenya, Colombia 


have the know-how and skills they need 


ot igital exclusion. 
| Identify five actions a government can take to reduce digi al on, | i ciniedicialacctnsien @ etal 
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* Digital inclusion means that everyone has affordable access to computing technology as well as the skills to 
use it. 
¢ Anumber of factors contribute to the digital divide — lack of or poor connectivity is one of the main ones. 


¢ Someone who is ‘technology-excluded’ misses out on all the opportunities computing technology offers. 


$1 What is meant by the terms ‘technology-empowered’ and ‘technology-excluded’? 

S2_ List three drawbacks of being ‘technology-excluded’. 

$3 List four factors that contribute to the digital divide. 

S4 Describe two ways of providing access to the internet in areas with a poor landline infrastructure. 


C1 Access to the internet is a key factor in reducing the digital divide. Describe two further measures that 
governments can take to promote digital inclusion. 


How confident do you feel about your answers to these questions? If you're 


not sure you answered them well, reread this section and have another go 
at the activities. 
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6.4 Professionalism 


Learning outcomes 


By the end of this section you should be able to: 
* explain what professionalism means in the context of computer science. 


Computer scientists write software to make computers do new things or 
accomplish tasks more efficiently, create mobile apps, design and build 


embedded systems, devise security policies, invent new products and much 
more besides. 


Some work for big multinational computing companies, such as Microsoft® 
and Apple, others for small start-ups; some are self-employed, some 

are employed in the IT departments of organisations such as hospitals, 
universities and companies. 


Wherever they work they are expected to behave ethically and demonstrate 
professionalism. 


The British Computer Society (BCS) is the Chartered Institute for IT. The 
BCS Code of Conduct sets out the professional standards its members are 
expected to uphold. Among many other matters, it specifies that computer 
scientists must: 


* respect the privacy, security and wellbeing of others and the 
environment; 


avoid injuring others, their property, reputation, or employment; 


develop their professional knowledge, skills and competence ona 
continuing basis; 


be familiar and comply with relevant legislation; 


not disclose confidential information; 


not misrepresent or withhold information on the performance of 
products, systems or services. 


Airtest produces exhaust emissions testing software. A programmer 
discovers that there is a bug in the software that produces inaccurate 
results under particular circumstances. 

State what course of action the programmer should take and explain why. 
(3 marks) 


This is a scenario-based question, so make sure you relate your answer to the 
scenario — don’t mention what you personally would do, but what the Airtest 
programmer should do. Don’t forget to refer back to the code of conduct. 
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Professionalism: the skill and 
competence expected of a 
person in a professional setting. 


Most computer scientists pay an 
annual membership fee to belong 
to a professional association. This 
gives them access to specialist 
technical conferences, training 
and publications. It also provides 
them with an opportunity to 
interact and share knowledge and 
expertise with each other. Having 
membership of a professional 
association on their Curriculum 
Vitae (CV) sometimes carries 
weight when a computer scientist 
is applying for a new job. 

The three main professional bodies 
for computer scientists in the UK are: 


e The British Computer Society 
(BCS) 

¢ The Institute of Electrical and 
Electronics Engineers (IEEE) 


¢ The Association for Computing 
Machinery (ACM). 


A computer scientist is developing an embedded system to control 
car brakes. 


Discuss in a group how she should respond if: 


1 


3 
4 


she spots a bug, but knows that fixing it will delay completion of 

the project; 

she’s aware that there is a remote possibility that someone could hack 
into the code and stop the brakes from working; 

a competitor offers to pay her for details of the code; 

her employer makes claims for the system that are not entirely true. 


Discuss any other ethical dilemma she might encounter as a professional. 


¢ Computer scientists must abide by the law. 


¢ They should behave ethically by adhering to a professional code of conduct. 


¢ It is important that programmers demonstrate professionalism as the work they do could put the lives of other 
people at risk. 


Strengthen 


S1 What does the BCS Code of Conduct say a computer scientist should do? 


Challenge 


C1 What does ‘professionalism’ mean for a computer scientist? 


How confident do you feel about your answers to these questions? If you're 
not sure you answered them well, reread this section and have another go at 
the activities. 
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6.5 The legal impact 


Learning objectives 
By the end of this section you should be able to: 


* describe how copyright and patents can protect intellectual property. 
° explain the purpose of a software licence. 


* differentiate between open-source and proprietary software and outline the benefits and drawbacks of each. 


Intellectual property 


Intellectual property (IP) — not to be confused with an IP address — isa 
unique creative product of a human mind. A piece of software, a computer 
game, a design for a new processor, a digital image, a piece of music and a 
literary work are all examples of IP. Each of them was created by somebody, 
is unique and has a commercial value. 


Copyright and patents 


The Copyright, Designs and Patents Act (1988) makes it illegal to copy, 
modify or distribute intellectual property without permission. 

That said, there are a multitude of peer-to-peer networks, torrent sharing 
websites and forums on the internet that allow people to download 
copyrighted software without paying for it. Not only is this illegal, it is also 
unethical, since it means that the programmer who wrote the code doesn’t 
get the money that is due to them. 

Copyright only protects the expression of an idea, not the idea itself. So if 
you were to develop an original piece of software, its source code would be 
protected, but there’s nothing to stop someone else from copying the idea 
and writing a program that essentially performs the same task. You would 
have to prove that the similarities between the two programs are more than 
just coincidence and can only be explained by copying. 

The © symbol indicates that a piece of software, a movie or some other type 
of artefact is protected by copyright. 


In 1994, Apple agreed to license parts of its GUI (see section 4.4, page 
162) to Microsoft® for use in the first version of Microsoft® Windows®. 
When Microsoft® released Windows® 2.0, it used the overlapping windows 
feature of the Macintosh OS, which was not included in the original 
licence agreement. Consequently, Apple sued Microsoft® for copyright 
infringement. 


A patent offers more protection than copyright. It protects the idea or 
design of an invention, rather than just a particular form of it. In order to get 
a patent you have to be able to demonstrate that what you have invented 

is distinct from anything else that already exists. A patent holder has the 
exclusive right for 20 years to make, use and sell their invention. 
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Intellectual property (IP): a 
creation of the human mind 
that is unique and has a 
commercial value. 


Patent: an exclusive right 
granted to an inventor to make, 
use and sell an invention for a 
fixed period of time. 


The creators of the illegal 
software sharing website, The 
Pirate Bay, received a prison 
sentence and were fined over 
US$4,000,000 for hosting 
hyperlinks to illegally obtained 
software and media. 


Make sure you know which laws 
affect the use of computing 
technology, including the Data 
Protection Act (1998), the 
Computer Misuse Act (1990), the 
Copyright, Design and Patents 
Act (1988) and the Regulation of 
Investigatory Powers Act (2000). 


Keep an eye out for any 
amendments to these laws or 
for relevant new government 
legislation. 


For years, various smartphone 
manufacturers including Apple and 
Samsung have been battling in the 
courts over patent infringements. 
Apple alleged that Samsung had 
stolen the ‘look and feel’ of its 
iPhone and used it in the Galaxy 
smartphone. 

There is a real concern that 

the inventiveness that patents 

are designed to encourage will 
suffer as a consequence of these 
so-called ‘patent wars’, with 
manufacturers spending their 
money on lawsuits rather than on 
new inventions. This would be very 
bad news for consumers. 


1 Get into groups of three. Each 
person in the group should 
describe how they would feel 
if an app they have created 
was made available for free 
on the internet, without their 
permission. 


2 Create a podcast for software 
developers, explaining how 
software can be protected as 
intellectual copyright. 


Creative Commons: an 
organisation that allows people 
. to set copyright terms for their 
intellectual property. One use 
of a Creative Commons licence 
is to allow people to copy 
material as long as it is not used 
commercially. 


Open-source software: 
software that is free to edit and 
redistribute. 


Assess the extent to which the patent system is a barrier to technological 
innovation. (4 marks) 


The command word here is ‘assess’, which means you should consider 
both sides of the argument and come to a conclusion. 


Licensing 

Every piece of software, even if it is free, has a licence. Even though the 

user purchases a piece of software, the licence states that they don’t 

actually own it. The licence allows the buyer to use the software subject 

to the licence terms, but the manufacturer retains ownership. Before you 

can install the software you have to agree to the terms of its licence. These 

specify: 

* how many copies of the software you are allowed to use; 

¢ whether you can install the software on more than one computer; 

¢ what type of organisation can use the software — some licences are for 
charities, students or home users only; 


¢ how long the software can be used for — perpetual software licences last 
forever, but some licence agreements expire unless you renew them. 


You are usually not allowed to resell the software. Paid-for software is often 
supplied with a unique licence key, which certifies that the software is 
genuine and prevents illegal copying. 


If a computer scientist wants to permit other people to use their code 
without charge, they can use an open-source licence to specify what 
restrictions (if any) there are. 


A Creative Commons licence provides a way for the creator of a piece 
of music, a photograph or other form of intellectual property - including 
software - to allow other people to use it providing they abide by the 
conditions specified in the licence. 


Open-source and proprietary software 


Open-source software is freely available on the internet. Anyone is 


permitted to edit the code and pass it on to others, providing they don’t 
charge a fee. 
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The advantages and disadvantages of open-source software are shown in 
this table. 


The Python programming 
SS msc language is open source and 


It is free to use. It might not be particularly ‘user friendly’ is maintained by the Python 
° It can be modified. and might look unprofessional. | Software Foundation. Members 
' : ; f the programming community 
° ¢ There might be | ; 
on | ee MET Weesaalatcs RA] o10. § often contribute to the source 
rate 


code to build on the language. 
programming concepts. ¢ Criminals may be able to identify and 


exploit vulnerabilities in the code. 


Proprietary software is the opposite of open source - it is closed source. This _| Proprietary software: software 
means its source code is protected and users are not allowed to modify it. | that belongs to an individual or 
; acompany. Its licence specifies 

_ that users are not allowed to 

_ modify the source code and 

. places restrictions on its use. 


On the plus side, proprietary software is extensively tested prior to release, 
any bugs that do come to light thereafter are quickly fixed and there is 
plenty of user support. The drawback is that if the software doesn’t exactly 
do what you want it to, you're not allowed to change it. 


Summary zz 


¢ A programmer loses out financially if their software is downloaded Microsoft® Windows® 10, iTunes, 
illegally. Adobe Photoshop and Mac OSX 

* The Copyright, Designs and Patents Act (1988) makes it illegal to copy, are all examples of proprietary 
modify or distribute intellectual property without permission. software. 


¢ Copyright protects the expression of an idea, not the idea itself. 

¢ A patent offers more protection than copyright. It protects the idea of 
an invention rather than just a particular form of it. 

e A software licence specifies how a piece of software can be used. 

¢ Open-source software is freely available and can be edited and shared. 

¢ The source code of proprietary software is protected and users are not 
allowed to modify it. 


Create a table that summarises 
the differences between open- 
source and proprietary software. 


S1 Describe the purpose of a software licence. 
S2 What are the main differences between open-source and proprietary software? 


C1 Explain how a patent would protect you as an inventor of a product. 
C2 Identify relevant legislation that computer scientists should be familiar with. 


How confident do you feel about your answers to these questions? If you're not 
sure you answered them well, reread this section to enhance your knowledge. 


225 


Preparing for your exam 


This section of the book will help you to prepare 

for the external assessment. GCSE (9-1) Computer 
Science has two written examination papers and a 
project. You must carry out the project in the final 
year of your course and sit both examinations in your 
final term. 


Paper 1 - Principles of Computer Science 


This paper is worth 40 per cent of your GCSE 
Computer Science qualification and is 1 hour and 

40 minutes in length. It is designed to test your 
knowledge and understanding of the principles of 
computer science, and covers aspects of all six topics 
listed in the specification. 


The number of questions in the paper can vary, but 
you must try to answer all of them. Most questions 
have some straightforward parts and some more 
challenging ones; even if you can’t answer all of a 
question you should be able to tackle at least parts 
of it. 


Understanding the questions 


Each question begins with a short introduction. 
Don't be tempted to skip these as they give guidance 
as to the sort of answer expected. You could miss 

out on marks by giving generic responses when a 
contextualised answer is required. 


It's important to pay attention to the command words 
at the start of questions. These indicate what sort of 
answer is needed and how much depth you should go 
into. The meaning of some frequently used command 
words is described below. 


State, give, name, identify, list. Recalling facts. 
For example: ‘State what is meant by the term 
overflow. or ‘Give one reason why programmers use 
subprograms. A simple statement will usually suffice 
as an answer. 


Describe. Giving a description of something. For 
example: ‘Describe how binary digits are used to 
represent bitmap images.’ Use the number of marks 
allocated as a guide to how many distinct points you 
should make. 
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Explain. Providing points that are linked to a 
justification. For example: ‘Explain why increasing 
the size of the cache will improve a computer's 
performance. This requires you to describe the 
purpose of the cache and then go on to explain why 
its size makes a difference. 


Calculate. Carrying out a calculation. For example: 
‘Calculate the file size in bytes of a 24-bit colour 
image that has a width of 600 pixels and a height of 
400 pixels.’ Remember that you are not allowed to 
use calculators. Instead you should show the relevant 
stages involved in carrying out the calculation. 


Compare. Observing similarities and differences 
between two or more alternatives. For example: ‘Rosie 
is developing a program to control a smart lighting 
system. Compare using an interpreter with using a 
compiler to translate the program into executable 
code.’ In this instance it’s advisable to take account of 
the context in your answer. 


Evaluate. Making critical judgements by considering 
relevant information. For example: ‘Evaluate the use 
of a “divide and conquer” strategy for sorting and 
searching data. 


Discuss. Exploring an issue, situation or problem. 
For example: ‘Discuss the ways in which computing 
technology can make a positive contribution to 
household energy consumption. You must structure 
your response and use appropriate evidence to 
support your conclusions. 


Show. Show the sequence of steps involved in 
applying a bubble sort algorithm to this list of 
names: “William”, “Rhys”, “Maddie”, “Greta”, “Edward”, 
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“Daani”, “Anja”. 


There will be at least one six-mark extended writing 
question in this paper. Make sure you allow enough 
time to plan your answer and try to use correct 
terminology where appropriate. 


Write your answers in the spaces provided on the 
paper. Try not to use more space than is provided and 
if you do write your answer somewhere else make 
sure it is clearly labelled. 


Preparing for your exam 


Sample answers with comment 
Here is a sample answer to a Paper 1 style question. 


Smarts Leisure owns a chain of fitness centres throughout the north-west. Its head office is in Stockport. 
Each fitness centre has its own wireless LAN, All the fitness centres are connected to head office via a WAN. 


1(a) (i) Identify two ways in which a LAN differs from a WAN. (2 marks) 


1 LAN on one site. 
2 WAN covers a wider geographic area. 


(it) Explain how devices are connected on a wireless network. (3 marks) 


Radio waves are used to connect devices on a wireless network. Each device has a wireless network 
adapter that sends and receives radio signals. A router broadcasts a wireless signal that devices can 
detect and ‘tune’ into. 


1(b) Files are divided up into packets before transmission. Two fields in a network. packet are source 
and destination. 


Give two additional fields found in a network packet. (2 marks) 


1 Sender 
2 Sequence Number 


1(c) Members of a fitness centre can log on to Smarts Leisure’s website to view details of their training. 


Explain the server-side processing that takes place when a member views their 
training online. (3 marks) 


The server receives the request from the client computer and retrieves training data from the database. 
It sends data to the client computer. 


1(d) Smarts Leisure is concerned about cyber-security. 
Discuss what measures Smarts Leisure should take to reduce the security risks from social 


engineering, (@ marks) 


Social engineering involves tricking people into revealing confidential information. Hackers use a variety of 
methods to get people to give away confidential information. 


An attacker could call the Smarts Leisure help desk pretending to be an employee who has forgotten their 
password, Staff might be fooled into helping them to log on remotely and reset their password. 


Smarts Leisure must make sure that all their employees are aware of this kind of attack and know that 
they should always verify a caller's identity before giving them access to the network or parting with 
sensitive information over the phone. 


227 


Preparing for your exam 


Emails that tempt users to click on a weblink are another risk. Staff might be fooled ino filling in a 
web form that asks for personal information. Or without realising it they might allow Trajan software 
to be secretly installed on their computer, which will capture sensitive information and email it to 
an attacker. 
Smarts Leisure must ensure that all their employees are aware that they must never reveal personal or 
sensitive information in response to an email, no matter who appears to have sent it. Hackers sometimes 
visit a company posing as a bogus employee or visitor. If they ave allowed in, they can look for information 
lying on desks, listen in to conversations, plant a key logger or connect a laptop to the network. 
Smarts Leisure has a lot of people going in and out of their fitness centres. This makes it more difficult 
to prevent unauthorised access. They should issue their employees with passes and restrict access to the 
office area to people with passes. Confidential information should never be left on desks and screen locks 
should be activated on all computers when not in use. 
Smarts Leisure should consider employing someone to carry out penetration testing to identify which of 
its employees are vulnerable to social engineering. These employees can then be given training to help 
them spot an attack and respond appropriately. 

(Total for Question 1 = 16 marks) 
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between a LAN and a WAN, not the two 


that were required. Each answer should They could have said that a LAN is relatively 
compare a LAN with a WAN. inexpensive to set up and run, whereas a WAN 
requires costly hardware and dedicated leased 
1(a) (ii) The candidate has made three lines. Using ‘whereas’ is a good way of making sure 
distinct points, demonstrating sound you compare one thing with another. 
understanding, 


1(b) There are lots of possible answers to this question, but you would get no marks for giving 
source or destination as an answer, since these are provided in the question. ‘Sender’ is just 
another name for ‘source’ so in this case the candidate only identified one additional field. 


1(c) This question is worth three marks, suggesting that the examiner is expecting three 
distinct points to be made. The student has only given two points so loses out on a mark. 
They omitted to say that the server constructs 
a web page to display results of a search. 


Your answers should take account of the context of 
the question. You could miss out on marks by giving 
generic responses. 


They have, however, related their answer to 
the scenario, which is a sensible thing to do. 


1(d) This question requires an extended answer. 
The student has selected relevant knowledge and demonstrated good understanding of key 
concepts. They have produced a well-structured, coherent response, identifying risks and 


suggesting appropriate measures. Their answer takes account of the context, giving advice 
directed at Smarts Leisure. 
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Paper 2 - Application of Computational Thinking 


This paper is worth 40 per cent of your GCSE Computer Science qualification. It focuses on topics 1 and 2 of the 
subject content of the specification and assesses your computational thinking and problem-solving skills. It is 
2 hours in length, which means you have plenty of reading and thinking time, as well as a chance to check your 


answers, especially the algorithms. (It’s easy to overlook logic errors and to use the wrong comparison operator 
in an expression.) 


Calculators are not allowed but you can use a stencil to help you draw flowchart symbols. 


Pseudo-code 


Some of the questions will use pseudo-code. It’s a good idea to familiarise yourself with the Edexcel pseudo- 
code before the exam. But don’t worry if you don’t remember alt the commands. You will have a clean copy of 
the pseudo-code booklet to refer to in the exam. You don't have to use the Edexcel pseudo-code in any of your 
answers. Any version is acceptable providing it is unambiguous and easy to follow. 


Understanding the questions 


You should familiarise yourself with the scenario that appears at the start of the paper. It sets the context for 
the whole paper. Further information is given at the start of each question. Read this carefully. In some cases, 
you won't be able to answer the question without referring to it. 


All the questions are compulsory. They are split into a number of parts, some more straightforward than others. 
Here are some of the command words you might encounter in this paper. 


Amend. Making changes. For example, amending an algorithm so that it performs differently or in order to 
correct an error. 


Assess. Judging or deciding the value, or appropriateness, of something. For example, assessing the 
appropriateness of a particular data structure. 


Complete, fill in. Filling in gaps. For example, completing a flowchart or a table. 
Construct, draw. Creating something. For example, constructing a flowchart, an expression or a validation test. 
Write. For example, a line of pseudo-code or an algorithm. 


Longer answers in this paper are likely to involve creating an algorithm expressed either in pseudo-code or as 


a flowchart. 
Calculators aren't allowed in this exam. If you're Look at the number of marks allocated to the 
no good at mental arithmetic, then you can at question for an indication of how many separate 
least show how the answer would be worked out. points you need to give. 
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Sample anfwers with comment 


Here are sample answers to Paper 2 style questions. 


1 The Riverside is a large, out-of-town shopping mall with shops, restaurants and leisure facilities. 


(a) The rent that retailers pay depends on the area of their unit (in square metres) and the 
number of years they have been there. 


| The rental charge is £100 per square metre, with a discount of 1 per cent for each year 
of occupancy. Construct a general expression showing the rent that a retailer will 
have to pay. (2 marks) 


rent = unitSize * (100 — years) 


(b) The Riverside wants to limit the discount it gives to no more than 10 per cent. 


Write an algorithm expressed in pseudo-code to calculate the rent for each retailer, 
taking into account the maximum discount. _ (2 marks) 


IF years >= 10 THEN 
rent = unitSize * 90 
ELSE 
| rent = unitSize * (100 — years) 
IFEND 


2 The Riverside produces visitor statistics. Data about the number of people who visit the shopping 
mall each day is collected and analysed. 


Complete the table to show an input, process and output. (2 marks) 


INPUT | PROCESS OUTPUT 


Number of visitors on each day | Calculate overall total number of | Sum of visitor numbers for the 
for one year | visitors in a year year 


| | Number of visitors on each day | Calculate overall total number Month with the highest number 
| | for one year | of visitors in each month. Sort of visitors 
|| | month totals to find month with 

the highest number of visitors 


3 The number of security and medical staff required each day depends on how many visitors 
are expected, 


The pseudo-code for a subprogram that calculates the number of security and medical staff 
required is shown. 
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re 
en SET days TO [80000, 30000, 30000, 50000, 55000, 60000, 100000} 
bad _ SET useAgain TO ‘no’ 

5 WHILE useAgain = ‘yes’ DO =“ 
! 6 SEND ‘Enter day number (Sunday = 0)’ TO DISPLAY yi 
Uy : RECEIVE dayNumber FROM (INTEGER) KEYBOARD 

8 SET visitors TO days[dayNumber] [= 

10 | SET security TO visitors DIV 1000 
haa | SET medical TO visitors DIV 20000 a 

12 | | 
(3) SEND ‘Do you want to enter another day (yes or no)?’ TO DISPLAY 

14 RECEIVE useAgain FROM (STRING) KEYBOARD 

15 | END WHILE _ mil 
16— END staff required 


(a) There is a logic error in the algorithm. 
Identify the error and explain how it can be corrected, (2 marks) 


The variable useAgain is initialised to ‘no’ before the start of the while loop, so the loop never executes. 
The error can be corrected by amending the statement in line 4. 


(b) Explain why the operator DIV is used in lines 10 and 11 rather than the divide (/) 
operator. (2 marks) 
Only whole people are needed. 
(c) Complete line 12 s0 that a sentence is displayed on the monitor stating: 


The number of security staff required is *** and the number of medical staff is ***,’ 
(2 marks) 


SEND ‘The number of security staff required is' + security + ‘and the number of medical staff is’ 
+ medical TO DISPLAY 


(d) Using the algorithm, calculate the numbers of security and medical staff needed on 
a Thursday. (2 marks) 


Number of security staff needed: 


| Number of medical staff needed: 


(e) Explain the benefits of using subprograms when — algorithms, (3 marks) 


Makes main program clearer and easier to understand and shorter. No repeated blocks of code — can 
be reused from anywhere in the program — easier to test — can be reused in other programs. 
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The Riverside issues customers with a loyalty card, which they can use to get a discount in any of the shaps. 
The discount varies according to how much they have already spent in the current year. if they have spent 
between £250 and £500 the discount is 5 per cent, If they have spent £500 or more it ts 10 per cenz. 


Complete the flowchart r22==---- 1 (6 marks) 
to show how the discount | 
is calculated, 


Offer customer a 


di loyalty card 


4 Input 
thisSale 


. toPay = thisSale End 
2 


toPay = thisSale - 5% 


thisYear = thisYear 
+ thisSale 


Riverside tries to keep the temperature constant at 20°C and the humidity at 50 per cent by opening 
air vents if it is too warm or too humid and closing them and using heaters when it is too cold, 


Temperature and humidity sensors ave located at various positions within the shopping mall. The 
external temperature is also monitored, 


The air vents are operated by electric motors, which rotate a bar clockwise to open them and 
anticlockwise to close them. 


If the temperature falls below 20°C the air vents are closed and the heaters are switched on. If the 
external temperature is 26°C or above and the air vents are open, then the air vents are closed and 
the air conditioning units are switched on. 


(a) Complete the table to give the inputs and outputs of this system. (G marks) 
Inputs ult Outputs 
Internal temperature Alr vent motors 
Humidity | Heaters 
External temperature Air conditioning unit 


(b) Write an algorithm to control the temperature and humidity, Use pseudo-code. (7 marks) 
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SET heatersOn TO False #initialises variables 

SET airVentsOpen TO False 

SET airConOn TO False 

SET riversideOpen TO True 

WHILE riversideOpen = True DO #loops while shopping mall is open 
RECEIVE inéTemp FROM (REAL) SENSOR 
RECEIVE humidity FROM (REAL) SENSOR 
RECEIVE extlemp FROM (REAL) SENSOR 


IF intTemp > 20 OR humidity > 50 THEN 
IF heatersOn = True THEN 
SEND ‘switch off signal’ TO HEATERS 
SET heatersOn TO False 
END IF 
SEND ‘clockwise signal’ TO AVMOTORS #opens air vents 
SET airVentsOpen TO True 
END IF 
IF intTlemperature < 20 THEN 
IF airVentsOpen = TRUE THEN 
SEND ‘anticlockwise signal’ TO AVMOTORS ##closes air vents 
SET airVentsOpen to False 
END IF 
SEND ‘switch on signal’ TO HEATERS 
SET heatersOn TO True 
END IF 
IF extTemp >= 20 AND airVentsOpen = True THEN 
SEND ‘anticlockwise signal’ TO AVMOTORS #closes air vents 
SET airVentsOpen to False 
SEND ‘switch on signal’ TO AIRCON 
SET airConOn TO True 


END IF 
SET riversideOpen TO check_if_open() 
END WHILE 
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1(a) Order of precedence is key here. The student has used brackets 50 that the subtraction is done before 
the multiplication. Remember BIDMAS! This answer was awarded two marks. 


1(b) In this question there was one mark for getting the condition in the IF statement correct - ‘years > 10’ 
would also have been correct. The second mark was for correctly calculating the rent in both cases. 


2 One mark is awarded for each empty cell correctly filled in with the correct information, 


The candidate has given a full description of the process needed to identify the month with the highest 
number of visitors. 
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3(a) The student has clearly identified the error and has indicated haw it can be fixed but they 
haven't spelled it out. They should have given the correct code, i.e. SET useAgain TO ‘yes’. 


3(b) This explanation is far too vague. The candidate probably knew that the DIV operator returns 
the total number of times that one number divides into another without a remainder but should 


have spelled it out to be sure of getting two marks, 


3(c) One mark for using the append operator (ampersand) and one mark for using the correct 
variables in the output statement. 


3(d) Both calculations are correct. 


3(e) The student has jotted down a list of separate benefits without any Justification. A better 
answer would have been: 


The use of subprograms makes code clearer and easier to understand (1) because the main 
program is shorter and doesn't contain any repeated blocks of code (1) and because by putting a 
name to a block of code it is easier to see at a glance what it does. (1) 


4 The student has done a great job of completing the flowchart. Three marks are awarded 
for functionality. In other words, a solution that works. The other three marks are for 
correct notation. 


5 There's a lot of text at the start of this question, but don't be tempted to skip it. Read 
it carefully and underline key words, these are references to values that need to be stored 
and processed, 

5(a) The student achieved all six available marks for this question. 

5(b) The student has written very clear, readable code and has earned all nine marks - three for 
the functionality of the algorithm, three for the accuracy of the pseudo-code and three for the 
efficiency of the solution. 


This is a well thought-out response that includes keeping track of whether the heaters and air con 
units are on or off and whether the air vents are open or closed, 


A few more explanatory comments would have been helpful, including an explanation of the 
subprogram check_if_open. 


234 


Preparing for your exam 


Non-examined assessment - the project 


The non-examined assessment component is worth 20 per cent of your GCSE Computer Science qualification. 
It is a project set by Edexcel that involves analysing a problem and then developing a computer program in a 
high-level language to solve it. You will have 20 hours to work on the project in the final year of your course. 


You must complete the project under controlled conditions. This means that you are not allowed to seek help 
from anyone else and must only work on the project when supervised by your teacher. Use of the internet is 
not permitted. But you are allowed access to a clean copy of the pseudo-code booklet, a syntax guide for the 
programming language you are using and a flowchart stencil. 


There are four stages to the project: analysis, design, implementation, and testing, refining and evaluation. 


At the end of the project you must submit the program you have written, along with a report showing the 
development process and including an evaluation. 


Your work will be assessed by your teacher and moderated by Edexcel. 


Sample project with comments 
Here is a sample project that will give you an idea of what to expect. 


How well are we doing? 


A train company runs rail services between Buxton and Manchester. It collects data about each of the services it 
runs. The starting point, day and departure time of each service, along with the number of passengers carried 
and the delay (in minutes) are recorded and stored in a text file. All the data for a week for services starting 
from Buxton is stored in one text file. A second file holds the data for services starting from Manchester. 


An extract from a file is shown below. 


Services from Buxton — week beginning 30 November 2015 


Day Time Passengers _Delay (mins) 
Monday | 05.59 | 150 , “Se wee 
Monday 06.18 350 0) 
Monday 06.48 525 2 
Monday | 07.19 [814 “se 
"Monday 0744 —|902 a ae 
"Monday «(| 98.21 870 ; {5 
“Monday a 522, u 
Tuesday | 05.59 : _ | 158 1 
Tuesday | 06.18 409 as 
Tuesday — : 06.48 7 yee0 : oO 
Tuesday —for19 tise 2 


Train company managers want to use the data that is collected each week to find out the average number 
of passengers per day, the average service delay time and the average number of passengers travelling on a 
particular service, e.g. the 06.48 train. 


Your task is to analyse this problem and design, implement, test and evaluate a solution. 
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Stage 1 Analysis (6 masks) kati 


Begin by summarising the problem and listing 
each of the requirements. 


The program must: 


© Ask the user which data file they want to use, i.e. which week and which station starting poine. 
¢ Read in the service data from a text file. 

* Offer users a menu of options to choose from. 

¢ Calculate and output the average number of passengers per day. 

¢ Calculate and output the average service delay. 

¢ Calculate and output the average number of passengers for a particular service. 


Decompose the problem into a set of sub-problems. Briefly describe each sub-problem. Write a short 
explanation of why you have decomposed the problem in this way. 


I have decomposed the problem into a set of sub-problems that cover all of the specified requirements, 


Data import: The data from the text file must be read in and stored in an appropriate data 
structure, In the course of a year 104 text files will be created. The user will need to specify which 
one they want to use. Currently there is no requirement to compare data for different weeks. 


Menu: The program must provide the user with three menu choices — 
(1) calculate daily average passenger numbers, 

(2) calculate average delay, and 

(3) calculate average passenger numbers for a specified service. 


It must also give the user the option of exiting the program. User input must be validated so that 
inappropriate data entry does not cause the program to crash. 


Average number of passengers: The program must compute the average number of passengers per day. 
To do this it needs to calculate the number of passengers who travelled each day and divide by the 
number of services on that day. The results should be displayed as a table with two columns — Day 
and Average number of passengers. 


Average delay: The program must calculate the average service delay. To do this it needs to compute 


the total amount of time taken up by delays in a given week and divide by the total number of 
services in that week. 


Service average: The program must prompt the user to select a particular service and compute the 
average number of passengers who travelled on that service in the given week. 


Each of the three menu options requires an average to be calculated. I plan to write a Subprogram that 
returns the average of a set of values that is passed to it. This will avoid code duplication and maximise 
the efficiency of the solution. 


I have broken the problem down in this way so that each sub-problem focuses on a specific 
requirement, For example, the sub-problem ‘Average number of passengers’ deals with the 
requirement to calculate and display the average number of passengers per day. 


When I move on to design my solution I will write a subprogram to solve each sub-problem, 
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Stage 2 Design (18 marty) 
The algorithm (12 marks) ok) —————— ae 


Now it is time to design your solution - 
fl S ue eee Pee tielegeecs as Logic errors can easily creep in during the 
a flowchart or a combination of both to express your algorithm. 
design stage and will cost you dearly if you 


Aim to design a subprogram to solve each of the sub-problems don’t spot them. 

| you identified in the analysis stage. Make sure you show how the To get full marks for your design you 

| subprograms link together. must make sure that there are no 
Study the data in the text file carefully and devise a suitable data errors in the logic. You also have to use 
structure to accommodate it. appropriate programming constructs 


in your algorithm and fully decompose 
the solution into subprograms. Not 
forgetting of course that your design 
Here is part of the algorithm for the service analysis program. must meet all the requirements! 


Don't forget to select meaningful identifier names for variables, 
data structures and subprograms. 


READ service _datal51130.txt Records 


PROCEDURE Menu() 

BEGIN PROCEDURE 
SEND ‘Please select one of the following options:’ TO DISPLAY 
SEND ‘1: Calculate average number of passengers’ TO DISPLAY 
SEND ‘2: Calculate average delay:’ TO DISPLAY 


SEND ‘3: Calculate the total number of passenger within a time period:’ TO 
DISPLAY 


SEND ‘4: Exit’ 
RECEIVE Choice FROM (Integer) KEYBOARD 
IF Choice >= 1 AND Choice <=4 THEN 
IF Choice = 1 THEN 
avgPassengers() 


BND IF 

IF Choice = 2 THEN 
avgDelay({) 

MAD IF 


IF Choice = 3 THEN 
avgService( ) 
END IF 
IF Choice = 4 THEN 
exitProgram 
ELSE 
SEND “Please select a valid option” TO DISPLAY 
Menu( ) 
END IF 
END PROCEDURE 


Complete the design for the service analysis program. _ 
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Test strategy and initial test plan (6 marks) 


As part of the design stage you must devise a strategy for testing your finished program. Your aim is to ensure 
that the program works and meets all the requirements. 


Don’t fall into the trap of only testing that your program works when valid data is entered. You should 
use normal, boundary and erroneous test data to ensure that your program runs correctly in all 
circumstances. Don’t forget you'll need to test that the subprograms interact as intended. 


Try to make your test plan as comprehensive as possible. It might help to ‘walk through’ your solution mentally, 
jotting down anything that needs testing as you go along. Pose ‘what if’ questions as key points. 

It’s not sufficient to have a strategy in your head. You need to summarise what your tactics are and explain 
why you think this approach will work. 


Here is a partially completed test plan. 


Test Purpose of __ Test data | Expected result Actual —- Action needed/ 
no ___ the test | Result comments 
1 | To ensure 1 | avgPassengers 
that only valid | subprogram called | 
menu choices | 5 avgDelay subprogram 
are accepted @illed | | 
3 avgService subprogram 
: _ | called 
4 | exitProgram | | | 


subprogram called 
+o 


0, ‘a’, 99 Invalid choice message 
displayed and user 
asked to re-enter their 
choice 


2 | To ensure that | The test file 
the average _| service_data151130. 
number of txt 
passengers 
per day is 
correctly 
calculated 


Copy and complete the test plan. Ensure the tests cover all 
aspects of the planned program. 
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Stage 3 Implementation (14 mary) 


If you've made a good job of the design, implementation should be easy. 


Your aim is to produce a fully functioning, error- 
free solution that addresses all the requirements 
of the problem. Your program should be fully 
decomposed into subprograms and the code 


Implement the solution you designed in the 
high-level programming language you are 
studying. Add comments to your program so 

that it's easy for other people to understand how 


‘it works. should be clear and easy to understand. 
Stage | Testing, refining and Examtip 
evaluation (42 marty) Make testing an integral part of the development 
process and don't leave it to the end. It's important to 
Time to put your test strategy into action and run the log any tests, outcomes and fixes that you carry out. 
you think of other tests you need to do. remember to test again once you've finished 


correcting them, just to make sure that you haven't 
introduced any new ones. 


Carry out your planned test strategy, filling in the For full marks it’s not enough simply to carry out 
remaining columns of the test plan as you do so._ the tests, you need to record the outcome in your 
test plan. 


The final step is to evaluate your solution against 
each of the requirements that you identified in the 
analysis stage of the project. 


If you do identify any errors you must show that 
you corrected the program to overcome them. 


Write an evaluation of your solution. Critically Write a critical evaluation of how the program code 
assess how successfully it meets the specified meets the requirements of the task, rather than 
requirements. just a commentary. 

The report 


You will need to gather evidence of the work you have done during the project and submit it as a report. This is 
what should be included. 


Stage 1 - Analysis |* A brief summary of the problem 
¢ A list of the requirements of the problem 
¢ A list of the sub-problems you have identified, together with a brief description of 
each one 
¢ Ashort explanation of your rationale for breaking down the problem in the way you have 


Stage 2 - Design ¢ Your algorithms 
e A brief description of your test strategy 
¢ Your initial test plan with the first four columns filled in 


Stage 3 - A copy of your program code 
Implementation A couple of screenshots showing debugging in action 
Stage 4 - ea: * The completed test plan with all six columns filled in and extra lines added if required 


refining and |* The evaluation of your program 
evaluation 
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Abstraction: the process of removing or hiding unnecessary 
detail so that only the important points remain. 


Access control: this determines which users have access 
to which data, and what they are allowed to do with it. 


Algorithm: a precise method for solving a problem. It 
consists of a sequence of step-by-step instructions. 


Analogue: using signals or information represented 

by a quantity (e.g. an electric voltage or current) that is 
continuously variable. Changes in the information being 
represented are indicated by changes in voltage. 

This method requires very accurate sending and receiving 
mechanisms. 

Application software: software that performs a task that 
would otherwise be done by hand, perhaps with pen 

and paper. 

Arithmetic/logic unit (ALU): the part of the CPU that 
performs calculations and logic operations. 

Arithmetic operator: an operator that performs a 
calculation on two numbers. 

Arithmetic shift: used for signed binary numbers. When 
performing a right shift the bits at the left are replaced by 
copies of the most significant bit. 

Array: an organised collection of related values that share 
a single identifier. 

Ascending order: this is arranging items from smallest to 
largest (e.g. 1, 2,3, 4,5,6o0ra,b,c,d,e, f). 

Assembly language: a low-level language written using 
mnemonics. 

Assignment statement: the SET...TO command is used to 
initialise variables in pseudo-code, for example: 

SET anotherGo TO 0 

SET correct TO" False 


Attribute: an item of data stored about an entity. 


Audit trail: a record of activities that have taken place on 
a computer system. This record is generated automatically 
and will record what has happened and who or what made 
the change. 


Authentication: the process of proving to a computer 
system who you are (e.g. using a username and password). 
Back-up: a copy of files in another location so that they 
are still available if the original copy is damaged or lost. 
Backing up is the process of making a back-up copy. 


Bandwidth: the amount of data that can be carried ona 
network in a given period of time. 


Binary: information represented by only two values (e.g. a 
voltage or no voltage, on or off). 

There are no communication errors or misunderstandings 
because there are no small differences. 


Binary digit or bit: the smallest unit of data that is 
represented in a computer. It has a single binary value, 
either 1 or 0. 


Binary prefix: multiplies a unit by powers of 2. 


Binary shift: an operation done on all the bits of a binary 
value in which they are moved by a specific number of 
places to either the left or right. 

Bit depth: the number of bits used to encode the 
information from each sample. Increasing the number 

of bits used increases the amount of detail contained in 
each sample. 

Boolean: something that can take only the values True or 
False; named after English mathematician George Boole. 


Brute force: an algorithm design that does not include 
any techniques to improve performance, but instead relies 
on sheer computing power to try all possibilities until the 
solution to a problem is found. 

Built-in functions: functions that are provided in 

most high-level programming languages to perform 
common tasks. 

Bus: a group of connections between devices in 

a computer. 

Bus width: the number of wires that make up a bus - 
this determines the range of binary numbers that can 

be transmitted. 

Byte: the basic combination of bits used to represent an 
item of information. A byte typically consists of 8 bits. 
Cache: memory used to make up for the difference in 
speed between two internal components. 

Carbon footprint: the amount of carbon dioxide an 


individual or organisation produces as a result of the 
energy they consume. 


Central processing unit (CPU): hardware device that 
carries out the processing in a computer. 


Character set: the defined list of characters recognised by 
a computer's hardware and software. 
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Checksum: an error detection technique. A mathematical 
formula is applied to the data and the resulting numerical 
value is transmitted with the data. The recipient computer 
applies the same formula to the received data and 

then compares the checksum sent with the data to the 
calculated checksum. If the checksums don’t match the 
data is likely to have been corrupted and the recipient 
computer requests the data again. 


Client-server network: a network that has at least one 
server to provide services to the client computers. 


Clock: an electronic device inside a CPU that ‘ticks’ at 
regular intervals and is used to synchronise the actions of 
the other parts of the CPU. 


Cloud storage: secondary storage, often belonging to 
a third party, that is accessed via a network, usually the 
internet, and so is not in the same physical place as the 
machine's RAM/ROM. Files stored ‘in the cloud’ can be 
accessed from anywhere via an internet connection. 


Code vulnerability: a computer program (the code) that 
has been written in such a way that it creates a security 
issue that may be taken advantage of to gain access to the 
computer system or data within it. 

Colour depth: the number of bits used to encode the 
colour of each pixel. 

Communication media: the means by which data is 
transmitted between devices on a network. Coaxial 

cable, fibre-optic cable and microwaves are all forms of 
communication media. 


Compiler: a translator that converts high-level language 
source code into object code, often machine code. The 
source code is translated all at once and saved to be 
executed later. 

Compound key: a key that consists of two or more fields 
used to identify a record uniquely. 

Compression: changing the format of a data file so that 
the size of the file becomes smaller. 

Computational thinking: the thought processes involved 
in formulating problems and their solutions so that the 
solutions are represented in a form that can be effectively 
carried out by a computer. 

Computing technology: an all-encompassing term 
referring to the hardware, software and infrastructure that 
underpin current and emerging computer systems. 
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Concatenation: the linking together of two or more items 
of information. 


Concurrent: processes that run apparently at the same 
time are described as being concurrent. 


Constant: a ‘container’ that holds a value that never 
changes. Like variables, constants have unique identifiers. 


Construct: a component from which something is built. 
Letters and numbers (i.e. a to z and 0 to 9) are the constructs 
we use to build our language and convey meaning. Bricks 
and cement are the basic constructs of a building. 


Control unit: the part of the CPU that organises the 
actions of the other parts of the CPU. 


Creative Commons: an organisation that allows people to 
set copyright terms for their intellectual property. One use 
of a Creative Commons licence is to allow people to copy 
material as long as it is not used commercially. 


Cyberattack: any kind of malicious attack on a network- 
connected device. 


Data structure: an organised collection of related 
elements. Arrays and records are two common data 
structures used in programming. 


Data type: specifies what kind of data it can hold. Common 
data types are integer, real, Boolean and character. The 
data type of a value determines the operations that can be 
performed upon it. 


Database: an organised store of data. 
Decimal prefix: multiplies a unit by powers of 10. 


Decomposition: breaking a problem down into smaller, 
more manageable parts, which are then easier to solve. 


Definite iteration: this is used when the number of 
iterations, or turns of the loop, is known in advance. It can 
be set to as many turns as you want. This sort of loop is 
said to be count controlled. 


Defragmenter: a utility that moves file clusters on a 
disk so they are closer to each other in order to speed up 
disk access. 


Denial of service (DoS): an attack on a network that 
attempts to prevent legitimate users from accessing 
its services. 


Descending order: this is arranging items from largest to 
smallest (e.g. 6, 5, 4, 3,2, 1 or f,e, d, ¢, b, a). 


Glossary 


Digital: information represented by certain fixed values 
(e.g. high, medium or low). Any signal between these values 
would be meaningless and not used. 


Sending and receiving mechanisms do not have to be as 
accurate as for analogue communication. 


Digital divide: the gap between people who are 
technology-empowered and those who are technology- 
excluded. 


Digital inclusion: ensuring that everyone has affordable 
access to computing technology and the necessary skills to 
take advantage of it. 


Divide and conquer: an algorithm design that works 
by dividing a problem into smaller and smaller sub- 
problems, until they are easy to solve. The solutions 
to these are then combined to give a solution to the 
complete problem. 


e-waste: any form of discarded electronic equipment, 
including computing technology. 


Eavesdrop: having unauthorised sight of data being sent 
from one computer to another over a network. 


Encryption: the process of encoding a message into 

a form that only the intended recipient can decode, or 
decrypt, and read. The message is encoded using an 
agreed method or algorithm. This is called the key. The 
encrypted message is called a cipher. 


Entity: something that is recognised as being capable of 
an independent existence, is uniquely identified and about 
which data is stored. 

It is usually a physical object (e.g. a car, a person or a book), 
but it can be a concept (e.g. a customer transaction where 
details of the items that were purchased are stored). 
Ethics: a set of moral principles that govern a person's 
behaviour. 

Execution: the process by which a computer carries out 
the instructions of a computer program. 
Fetch-decode-execute cycle: sequence of steps carried 
out repeatedly by a CPU. 

Field: an individual element in a record. 

File: all of the records in a table or a group of linked tables. 


Firewall: a utility that controls program access to the 
network, both incoming and outgoing. 


Flowchart: a graphical representation of an algorithm. 
Each step in the algorithm is represented by a symbol. 
Symbols are linked together with arrows showing the order 
in which steps are executed. 


Foreign key: a field in one table that uniquely identifies 
records in another table (i.e. it is a primary key in 
another table). 


Function: a function is a subprogram that performs a 
specific task and can be used at any point in the program. 
High-level programming languages have a number of 
useful built-in functions. You can also create your own or 
use functions available in online libraries. 


Global variable: a variable that can be accessed from 
anywhere in the program, including inside subprograms. 


Hacking: the act of gaining unauthorised access to a 
computer system and the data it contains. 


Heuristic: a type of algorithm capable of finding a solution 
to a problem quickly and easily, by using a combination of 

trial and error and educated guesswork to cut corners and 

eliminate less likely alternatives. 


Hexadecimal: a base-16 number system. There are 16 
digits and the place values increase in powers of 16. 
High-level programming language: a programming 
language that resembles natural human language. 
HyperText Markup Language (HTML): essentially a text 
document that contains any text to be displayed along 
with: 
* details of how the text should be formatted (e.g. font size, 
colour etc,); 


* details of any hyperlinks and where they link to using 
a URL; 


* details of any objects such as pictures or videos that 
should be shown within the web page when it is 
displayed. 

Identifier: a unique name given to a variable or a constant. 

Using descriptive names for variables makes code much 

easier to read. 


Identity theft: the stealing of another person’s personal 
details, such as their bank account number, sort code or 
passport number, for the purpose of making purchases 
and running up debts in their name. 
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IF... THEN...ELSE statement: the IF... THEN...ELSE 
statement allows a choice to be made between two 
alternatives based on whether or not a condition is met 
(e.g. IF it is cold THEN wear a jumper ELSE wear a T-shirt). 


Indefinite iteration: this is used when the number of 
iterations is not known before the loop is started. The 
iterations stop when a specified condition is met. This sort 
of loop is said to be condition controlled. 


Infinite loop: a loop that is never-ending since the 
condition required to terminate the loop is never reached. 


Initialisation: the process of assigning an initial value to 
a variable. 


Input: to enter data into a computer. 


Instruction set: the list of all possible commands a 
particular CPU knows how to carry out. 

Integer: a whole number (e.g. 3, 6, 9). 

Integrated Development Environment (IDE): a package 
that helps programmers to develop program code. It has a 
number of useful tools, including a source code editor and 
a debugger. One of the most useful features of an IDE is 
the debugger. One of its tasks is to flag up syntax errors in 
the code and issue helpful error messages. 

Intellectual property (IP): a creation of the human mind 
that is unique and has a commercial value. 

Internet: a worldwide system of interconnected networks 
that enables information to be exchanged and shared. 
Internet Service Provider (ISP): an organisation that 
provides its customers with a connection to the internet. 
Interpreter: a translator that converts high-level 
language source code into object code, often machine 
code. The source code is translated and executed one line 
at a time. 

Iteration: a construct that means the repetition of a 
process. An action is repeated until there is a desired 
outcome or a condition is met. It is often referred to as 

a loop. 

Local area network (LAN): a network that covers a 
relatively small geographical area, often a single site. 
Local variable: a variable that is accessed only from within 
the subprogram in which it is created. 

Location-based services: services that enable people to 
access and share real-time location information online. 
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Logic circuit: an electronic circuit that has inputs and 
outputs that follow one of the Boolean operators. 


Logic error: an error in an algorithm that results in 
incorrect or unexpected behaviour. 


Logical operator: a Boolean operator using AND, OR 
and NOT. 


Lossless compression: compressing a file in such a way 
that it can be decompressed without any loss of data. 


Lossy compression: compression where some of the data 
is removed; the original file cannot be restored when the 
lossy file is decompressed. 

Low-level programming language: a programming 
language that is closely related to the CPU's machine code. 
Machine code: the binary codes representing each of the 
instructions in the instruction set. 


Magnetic storage: secondary storage that works by 
magnetising parts of a substance as north and south poles 
to represent binary 1s and Os. 


Main memory/random-access memory (ROM): a 
temporary store for data and instructions (programs). 


Malware: short for ‘malicious software’. It is used as a 
generic term for any kind of software that is designed to 
disrupt the use of a computer system. 


Median: the middle number when the numbers are put 
in ascending or descending order (e.g. if there are 13 
numbers, then the 7th number is the median). If there 
are an even number of items in an ascending list, choose 
the item to the right of the middle (e.g. if there are 10 
numbers, then choose the 6th as the median). 


Memory address: a number that uniquely identifies a 
(memory) storage location. 


Mnemonic: a short, simple, acronym that represents each 
of the instructions in a CPU's instruction set, e.g. LDR (load 
register), STR (store) and CMP (compare). 

Modular testing: testing each block of code as it is 
completed to ensure the code works as expected. 

Monte Carlo methods: carrying out a statistical analysis 

of a number of random samples in order to obtain 
approximate solutions to a problem. The larger the number 
of samples used, the more accurate the result is likely to be. 
Most significant bit (MSB): the bit with the highest value 
ina multiple-bit binary number. 
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Nested IF statement: a nested IF statement consists of 
one or more IF statements placed inside each other. A 
nested IF is used where there are more than two possible 
courses of action. 


Nested loop: a loop that runs inside another loop. The 
inner one executes all of its instructions for each turn of 
the outer loop. 


Network: an arrangement of computers and other devices 
connected together to share resources and data. 


Network security: activities designed to protect a 
network and its data from threats such as viruses, hacker 
attacks, denial of service attacks, data interception and 
theft, and equipment failure. 


Network topology: describes how the devices ona 
network are connected together. 


Neural networks: processing information in a similar way 
to human brains and learning and adapting over time. This 
makes them useful tools for solving pattern recognition 
problems, such as facial recognition, medical diagnosis 
and quality control, which computers are normally not 
very good at. 


Non-volatile: memory that is not lost when the power is 
turned off. 


Object code: the translated source code. Often this will be 
machine code, but might also be an intermediate code, 
which has to be further translated before it can be executed. 


Open-source software: software that is free to obtain, 
edit and redistribute. 


Operating system: software designed for particular 
hardware and which manages other programs’ access to 
the hardware. 


Operator precedence: the order in which you apply the 
operators (including logical operators) in a mathematical 
equation. 


Optical storage: secondary storage that works using 
differences in light reflection from a material. 


Output: to display or output data that has been processed 
(or has been stored). 


Packet: a small quantity of data being sent through a 
network. The packet is labelled with the sender's address 
(source), the recipient's address (destination), how many 
packets are being transmitted and the position of this 
packet in the complete message. 


Paging: the algorithm the OS uses to move programs 
from RAM to disk and back again when needed once main 
memory is full. 


Parameters: values that are passed into a subprogram 
when it is called. 


Patent: an exclusive right granted to an inventor to make, 
use and sell an invention for a fixed period of time. 


Peer-to-peer network: a network that doesn't have any 
centralised servers. Each computer in the network can act 
as client and server. 


Personal data: information that is personal and unique to 
an individual. 


Phishing: attempting to get confidential information by 
sending a user a message that appears to come froma 
trustworthy person or organisation. 


Physical security: controlling access to critical parts of a 
network using physical methods (such as locked doors) 
rather than software. 


Pixel: short for ‘picture element’, the smallest single point 
of colour in a graphic image. 


Primary key: a unique identifier for each record in the 
table. It is usually a field that is guaranteed to hold unique 
information for each record. 


Privacy: the right to be left alone and free from unwanted 
scrutiny and intrusion. 


Procedure: a subprogram containing a set of statements 
that are executed when the procedure is called. Unlike 

a function, a procedure does not return a value to the 
main program. 


Process: to change the meaning or format of some data. 


Professionalism: the skill and competence expected of a 
person in a professional setting. 


Proprietary software: software that belongs to an 
individual or a company. Its licence specifies that users 
are not allowed to modify the source code and places 
restrictions on its use. 


Protocol: a set of rules that govern how communications 
on a network should be formatted and what data they 
should include. 


Pseudo-code: a structured, code-like language that can be 
used to describe an algorithm. 
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Random number: a number within a given range of 
numbers that is generated in such a way that each 
number in the range has an equal chance of occurring. 


There are many devices for generating random numbers. 
A die is used in games to get a random number from 1 
to 6. Computer programming languages have a function 
for generating random numbers across variable ranges. In 
the Edexcel pseudo-code there is a useful built-in 
RANDOM command. 

RANDOM (upperLimit) 


For example, number = RANDOM(6) would generate a 
random number from the numbers 1 to 6. 

Read-only memory (ROM): memory that cannot be 
altered and is not lost when the power is turned off. 
Reading: when the CPU retrieves the data stored at a 
given address. 

Record: a data structure that stores a set of related values 
of different data types. 

Recursion: a process that is repeated. For example, a 
document can be checked and edited, checked and edited 
and so on until it is perfect. 

Register: a storage location inside the CPU used to hold 
an instruction, an address or other single item of data. 
Relational database: a database that allows data 
elements in one table to be related to any piece of 

data in another table as long as both tables contain a 
common element. 

Relational operator: an operator that compares two 
values. 

Resolution: the number of pixels per inch when the image 
is displayed (for example, on a monitor or on paper). 
Router: a piece of networking hardware used to forward 
packets of data from one network to another. 

Runtime error: an error that occurs while the program 

is running — the operation the computer is asked to do is 
impossible to execute. 

Sampling: taking measurements of the sound wave at 
regular but distinct intervals of time (e.g. 44,100 samples 
per second). 

Scheduling: the algorithm that the OS uses to allow each 
running process to use the CPU. 

Scope: the region of code within which a variable is visible. 
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Secondary storage: any kind of permanent storage to 
which the contents of ROM/RAM are copied (usually a hard 
disk, optical or solid-state device). 


Selection: a construct that allows a choice to be made 
between different alternatives. 


Sequence: an ordered set of instructions. 


Shoulder surfing: gaining access to information by 
watching someone enter it into a computer system. 


Simulation: a representation of a real-world process 
or system. 


Social engineering: any kind of attack on a computer 
system or network that exploits how people behave and 
respond to certain situations. 


Software: the set of programs run by a computer system. 


Solid-state storage: secondary storage that works by 
storing charge (electrons). 


Source code: the text of the program that a programmer 
writes. 


Spyware: software, possibly a virus, that is designed to 
be installed secretly on a computer and record private 
information as the user enters it. 


String: a sequence of characters. They can be letters, 
numbers, symbols, punctuation marks or spaces. 


String traversal: using a loop to cycle through each 
character in a string. 


Subprogram: a self-contained module of code that 
performs a specific task. It can be ‘called’ by the main 
program when it is needed. 


Surveillance technology: CCTV, drones, number plate 
recognition, bugging and tracking devices used to 
monitor and record people's activities, often without their 
knowledge. 

Syntax error: an error that occurs when a rule of the 
programming language is broken. 

Table: a collection of cells organised in rows and columns 
used to store data in a structured and organised manner. 
Text file: a sequence of lines, each of which consists of 

a sequence of characters. 

Trace table: a technique used to identify any logic errors 


in algorithms. Each column represents a variable or output 
and each row a value of that variable. 


Glossary 


Translator: a program that converts source code into 
machine code. 


Traversal: travel across or through something. An array 
can be traversed by moving from the first to the last 
element in order to examine the data stored at each 
index position. 


Truth table: a table showing all possible combinations 
of the inputs and outputs of an operator. 


Two-dimensional array: a matrix of rows and columns 
resembling a table. Two indices are used, one to 
reference the rows and the other the columns. All the 
elements of a two-dimensional array share the same 
data type. 


Type coercion: the process of converting the value 
stored in a variable from one data type to another. 


Unambiguous: this means that the instructions 

cannot be misunderstood. Simply saying ‘turn’ would 
be ambiguous because you could turn left or right. All 
instructions given to a computer must be unambiguous 
or it won't know what to do. 


Unpatched software: software that hasn't had the 
latest security updates applied to it, making it vulnerable 
to attack. 


USB: Universal Serial Bus sockets found on most modern 
computer systems. 

User interface: the way the user interacts with the 
operating system 

Utility software: software that does a useful job for the 
user that is not essential to the operating system and 
not the reason for using a computer in the first place. 


Validation: to check that the data entered by a user or 
from a file meets specified requirements. , 


Variable: a ‘container’ used to store data. The data stored 
in a variable is referred to as a value. The value stored ina 
variable is not fixed. The same variable can store different 
values during the course of a program and each time a 
program is run. 

Virtualisation: any process that hides the true physical 
nature of a computing resource, making it look different, 
usually to simplify the way it is accessed. 


Virus: software that is designed to make and distribute 
copies of itself, usually for a malicious purpose. 


Volatile: memory that is erased when the power Is 
turned off. 


von Neumann architecture: computer system design in 
which the program is stored in memory with the data. 


Web server: powerful computer systems that store web 
pages and any multimedia that the pages might contain. 


‘What if?’ question: running a computer model with a 
given set of inputs to see what the model produces as an 
output or prediction. 


Whistle-blower: someone who draws attention to the 
activities of an organisation or person believed to be 
acting illegally or unethically. 


Wide area network (WAN): a network that covers a large 
geographical area. It connects together two or more LANs 
and is usually under collective ownership. The largest 
wide area network is the internet. 


Wireless local area network (WLAN): a local area 
network in which connected devices use high frequency 
radio waves to communicate. 


Writing: when the CPU sends data to memory to be 
stored at a given address. 
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thinking. 
Other key features of the Student Book include: 


- in-depth coverage of the core content you need for all 


components of your course 
* a‘Preparing for your exam’ section with tips and guidance 
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for achieving success in your written exams 
- ‘Exam-style questions’ and ‘Exam tips’ throughout to give 
you plenty of practice at answering the kind of questions 
you might find in the exam 
* ‘Checkpoint’ features to check your knowledge and 
challenge your understanding 
- ‘Worked example’ features to help you understand how to 
carry out practical skills before you try an activity yourself. 
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